MIDAS

  • Open Search Box
  • Toggle side widget area
  • No menu assigned!

research data analysis michigan

Spotlight on Research

  • April 18, 2024 MIDAS Affiliate Faculty member Nicholas Kotov named 2024 American Association for the Advancement of Science Fellow false https://news.engin.umich.edu/2024/04/2024-aaas-fellows-include-three-michigan-engineering-professors/
  • April 16, 2024 Atlas of the human ovary offers huge potential for new treatments – new study from MIDAS affiliate Jun Li and team false https://www.science.org/doi/10.1126/sciadv.adm7506
  • April 15, 2024 New study from MIDAS affiliate Puneet Manchanda explores video game addiction rates false https://record.umich.edu/articles/new-study-explores-video-game-addiction-rates/
  • April 10, 2024 MIDAS affiliate Christian Sandvig recipient of U-M 2023 public engagement award false https://record.umich.edu/articles/sandvig-dworkin-receive-2023-public-engagement-awards/
  • February 21, 2024 MIDAS affiliate Anne Draelos among U-M researchers named Sloan Research Fellows false https://record.umich.edu/articles/three-u-m-researchers-named-sloan-research-fellows/

Upcoming Events

  • May 21, 2024 BRISP Conference: Advancing Behavioral Science through AI and Digital Health https://michr.umich.edu/new-events/2024/5/21/advancing-behavioral-science-through-ai-and-digital-health
  • May 29, 2024 Generative AI Tutorial Series – Data Analysis: Visualization and presenting information https://midas.umich.edu/generative-ai-tutorial-series/#data-analysis-iii
  • June 3, 2024 – June 7, 2024 Introduction to Data Science and AI Summer Academy 2024 https://midas.umich.edu/workshops/intro-ds-summer-academy-24/
  • June 17, 2024 – June 21, 2024 Introduction to Data Science and AI High School Summer Camp 2024 https://midas.umich.edu/summer-camp-2024/
  • July 8, 2024 – July 12, 2024 Biomedical Data Science Summer Academy 2024 https://midas.umich.edu/workshops/biomedical-academy-2024/
  • August 12, 2024 – February 16, 2024 MIDAS-ICPSR Social Data Science Summer Academy 2024 https://midas.umich.edu/workshops/social-data-sci-academy-24/

The Michigan Institute for Data Science strengthens the University of Michigan’s preeminence in Data Science and Artificial Intelligence and enables their transformative use in a wide range of research disciplines to achieve lasting societal impact.

MIDAS By The Numbers

Sign Up for the MIDAS Newsletter

University of Michigan Office of Research

Copyright © 2020 The Regents of the University of Michigan

  • U-M Gateway
  • U-M Non-discrimination Statement
  • Governance and Advisory Boards
  • Annual Highlights
  • Newsletters
  • Research Pillars
  • MIDAS Funded Projects
  • Generative AI
  • AI in Science Program
  • Michigan Data Science Fellows
  • Graduate Certificate
  • Summer Academies
  • Future Leaders Summit
  • Summer Camp
  • Other Programs
  • Mini-Symposia
  • 2024 ADSA Annual Meeting
  • Past Events
  • Academic Partnerships
  • Data Science for Social Good
  • Working Groups
  • Staff Collective for Data Science
  • NLP+CSS 201
  • Student Organizations Council
  • Projects for Students
  • Generative AI Resources
  • Reproducibility Hub
  • Research Datasets
  • Drug Discovery IUCRC

research data analysis michigan

Data Analytics

Data science research uses principles from computation, machine learning, statistics, and mathematics, to develop methods to analyze data and gain insight and knowledge about underlying systems to improve decision making. Data analytics is highly interdisciplinary with significant overlap with many areas, including human system integration, optimization, and stochastic systems.

This area includes:

Big Data Analytics:  Techniques for analyzing large amounts of data to visualize patterns and discover the underlying principles governing the operational performance of industrial systems.

Predictive Analytics:  Methods for predicting future outcomes, assessing and quantifying uncertainty about the future behavior of systems, and identifying the risk associated with decisions under varying environmental conditions.

Adaptive Learning:  Fostering adaptive learning processes that use one or more sources of data collected over time to optimize dynamic decision-making and risk management.

RELATED NEWS

U-M IOE takes home awards from the INFORMS Annual Meeting

U-M IOE takes home awards from the INFORMS Annual Meeting

Helping people get back to work using deep learning in the occupational health system

Helping people get back to work using deep learning in the occupational health system

Preventing prescription dispensing errors using machine intelligence

Preventing prescription dispensing errors using machine intelligence

IOE Faculty Launch New Data Analytics Workshop

IOE Faculty Launch New Data Analytics Workshop

Salar Fattahi introduces a new data analytics course for IOE

Salar Fattahi introduces a new data analytics course for IOE

IOE undergraduates prep for a modern workforce with expanded data analytics courses and support

IOE undergraduates prep for a modern workforce with expanded data analytics courses and support

Salar Fattahi joins the IOE faculty

Salar Fattahi joins the IOE faculty

Seth Guikema elected President of the Society for Risk Analysis

Seth Guikema elected President of the Society for Risk Analysis

Mitigating uncertainties in remote computer numerical control using data-driven transfer learning

Mitigating uncertainties in remote computer numerical control using data-driven transfer learning

Jon Lee selected for Centre de Recherches Mathématiques Scholar-in-Residence Program

Jon Lee selected for Centre de Recherches Mathématiques Scholar-in-Residence Program

Home

Search form

Data research analyst.

Company Name:  Georgetown University Department:  Center for Security and Emerging Technology Application Deadline:  Jul 7, 2023 Notes:  This dynamic role serves as a bridge between data and analysis teams and combines knowledge of research methods and data analysis skills. Link to additional information and/or application:  Additional Information and/or Application

CSCAR

  • Open Search Box
  • Toggle side widget area
  • No menu assigned!
  • About CSCAR

Consulting for Statistics, Computing and Analytics Research (CSCAR) provides individualized support and training to University of Michigan researchers in a variety of areas relating to the management, collection, and analysis of data. CSCAR also supports the use of technical software and advanced computing in research. Researchers from nearly all disciplines at U-M have made effective use of our services.

Our scope is broad, including formal statistical analysis, management of large data sets, development and optimization of computing code, data visualization, predictive modeling, geographic information systems, and text analysis, among other areas.  See our areas of expertise page  for more details.

Many of our services are free to the U-M research community.  Contact us at [email protected]  with administrative questions.  Technical questions can be sent to the email addresses listed on our  contacts page .

The CSCAR Team

research data analysis michigan

Chris Andrews

research data analysis michigan

Jesse Cordoba

research data analysis michigan

Josh Errickson

research data analysis michigan

Brenda Gillespie

research data analysis michigan

Abner Heredia Bustos

research data analysis michigan

Carol Janney

research data analysis michigan

Hyungjin Myra Kim

research data analysis michigan

Corey Powell

research data analysis michigan

Kerby Shedden

Sign up for our newsletter.

  • U-M Gateway
  • U-M Non-discrimination Statement

University of Michigan Office of Research

Copyright © 2017 The Regents of the University of Michigan

Your browser is ancient! Upgrade to a different browser to experience this site.

Data Science for Health Research

Description.

In Data Science for Health Research, learn to organize and visualize health data using statistical analysis in programs like R. Explore how to translate data, interpret statistical models, and predict outcomes to help make data-informed decisions within the public health field.

research data analysis michigan

U-M Credit Eligible

Instructors.

research data analysis michigan

Philip S. Boonstra, PhD

Associate Professor, Biostatistics

research data analysis michigan

Bhramar Mukherjee PhD

Professor of Epidemiology

Courses (3)

research data analysis michigan

Arranging and Visualizing Data in R

research data analysis michigan

Linear Regression Modeling for Health Data

research data analysis michigan

Logistic Regression and Prediction for Health Data

Know someone who would like this course? Share it with them!

Share on Facebook

Share on Twitter

Share on LinkedIn

  • Departments and Units
  • Majors and Minors
  • LSA Course Guide
  • LSA Gateway

Search: {{$root.lsaSearchQuery.q}}, Page {{$root.page}}

  • News and Events
  • Computing Resources
  • Diversity, Equity, and Inclusion
  • Provide Climate Feedback
  • Undergraduate Students
  • Ph.D. Students
  • Master's Students
  • Alumni and Friends

Department of Statistics

  • Undergraduate FAQs
  • Statistics Grad Student Tutors
  • Transfer Credit
  • Undergraduate Programs
  • Michigan Undergraduate Students of Statistics (MUgSS)
  • Undergraduate Courses
  • Undergraduate Research
  • Statistics Ph.D. Student Council
  • FAQs for Current Students
  • Graduate Student Spotlight
  • Ph.D. Program
  • Graduate Courses
  • Alumni Spotlight
  • Prospective Ph.D. Students
  • Graduate Resources
  • Master's Degree Programs
  • Prospective Master's Students - Admissions
  • Frequently Asked Questions (FAQs)
  • Statistics Alumni
  • Giving Opportunities
  • Statistics PhD Alumni
  • Statistics Career Placements

Undergraduate research in statistics provides opportunities for gaining experience in data analysis, reading and writing about statistics, and collaboration with Statistics faculty mentors and their research teams. By doing an undergraduate research project, you will develop a deeper understanding of statistics, whether as a first/second year student considering a statistics major, or as a junior/senior considering graduate school and other career options. It is recommended that students considering graduate school participate in research during the course of their academic studies.

The two largest programs for Undergraduate research in data science and statistics are the Undergraduate Research Program in Statistics and the honors thesis. There are also other faculty research projects that include undergraduates that are not in either of these programs. Other statistics research-related activities involving undergraduates include the following:

  • The Undergraduate Research Opportunity Program (UROP) provides research opportunities for first and second year students.
  • The annual  Michigan Student Symposium for Interdisciplinary Statistical Sciences   (MSSISS) . MSSISS provides a forum for presenting completed research projects, and an opportunity to see the range and scope of statistical activity across the University of Michigan. Most of the research projects are carried out by graduate students, but undergraduates are welcome to participate and many have!
  • The Statistics department occasionally runs a data mining competition. 
  • A relevant national forum is the free  Electronic Undergraduate Statistics Research Conference , and the associated  Undergraduate Statistics Project Competition .
  • The   Center for Statistics, Computing, and Analytics Research   (CSCAR) sometimes employs undergraduates. Email  [email protected]  if you are interested in learning more about opportunities for involvement with CSCAR.

Undergraduate Research Program in Statistics (URPS)

URPS is a competitive program where Statistics faculty offer undergraduate research projects for the winter semester. 

The URPS 24 information session is Friday, December 1 from 3:30-4:30pm in 340 West Hall.

The application deadline is Tuesday, December 5 at 11:59pm. The application can be found at this link .

Students who will be informed by Monday, December 11 if they are selected, subject to an interview with the project supervisor. UPDATE: 12/12/23 No students have been informed yet about the results of the application. There were more applications than anticpiated this year and it's taking longer than previous years. We are hopeful that the results will go out today. Please be patient in waiting for the results.

Undergraduate Reserach Program in Statistics Winter 2024 Projects - URPS 24

Past Projects

Writing an Honors Thesis

An honors thesis provides an opportunity for eligible students to carry out faculty-supervised research in the senior year. The application process and requirements for the Statistics ,   Data Science , and Informatics honors programs are described on the department website.  Students are encouraged to contribute their thesis to the   archive of honors theses   at the University of Michigan Library.

Past Honors Theses

  • Xinpei Shen, Data Science - A Dimension Reduction Approach to Multivariate Mediation Analysis
  • Weizhe Sun,  Statistics -   Model Based Inference of Stochastic Volatility via Iterated Filtering
  • Jiayi Xu, Data Science - Investigating Measles Dynamics in the Pre-Vaccination Era: A POMP Model Approach
  • Zuyuan Han, Data Science - Signature Methods in Variance Swap Pricing
  • Yiwen (Oliver) Wu, Data Science - Assessment of Privacy in Synthetic Data
  • Chen Shang, Statistics and Mathematics -  Mat ́ern Models for Graphs: Definition and Inference
  • Mingxuan Ge, Statistics and Mathematics -  Redistribution of Equity Returns After The Minimum Wage Policy
  • Will Schmutz, Data Science -  Statistically Ranking Teams in the English Premier League
  • Xinyi Xie, Statistics and Mathematics -  Logistic Regression With Log-Contrast Transformation
  • Yiling Huang, Statistics and Mathematics -  Balance Assessment of Matched Data with Multiple Treatment Levels
  • Chenxi Fan, Statistics - An evaluation of information criteria for model selection in quasi-likelihood regression, with application to modeling COVID mortality and case incidence in the United States
  • Siqi Li, Statistics - Local False Discovery Rates in the Multi-Parameter Case, with Application to Epigenetics of Human Growth
  • Wanqi Liang, Data Science - An Applet and Tutorial for Calculating the Sample Size (and Power) for a Clustered Sequential, Multiple Assignment Randomized Trial
  • Juejue Wang, Statistics - Comparison of Document Co-clustering aslgorithms and Application of Single-cell RNA-seq Data Clustering to Twitter Data
  • Chao Peter Yang, Data Science - The Classical-Romantic Dichotomy: A Machine Learning Approach
  • Ziyang Shao, Statistics - College Ranking Based on Pairwise Preferences
  • Haoyu Chen, Statistics -  Kernel Methods for Activation Energy Prediction
  • We Han, Statistics -  Argo Data Mean Field Modeling
  • Jiahui Ji, Statistics -  NYC Optimal Transport and Ridesharing
  • Xiaotong Yang, Statistics -  Fitting mechanistic models to Daphnia panel data within a panelPOMP framework
  • Shuaiji Li, Statistics - Auto Sales Prediction with attention to the Parable of the boiled frog: Functional Data Analysis and Time Series Forecasting
  • Zifan Li, Statistics - Perturbation Algorithms for Adversarial Online Learning
  • Tianwen Ma, Statistics - A Functional Data Analysis Approach to Looking at Handwriting Data
  • Xige Zhang, Statistics - Robustness of the Contextual Bandit Algorithm to A Physical Activity Motivation Effect
  • Rong Zhou, Statistics - The Comparison of ACI and MCB Methods for Choosing a Set that Contains the Optimal Dynamic Treatment Regime
  • Xinyan Han, Statistics - An Empirical Comparison of Various Online Binary Classification Algorithms
  • Hwanwoo Kim, Statistics - A Sample Size Calculator for SMART Pilot Studies
  • Yuchen Lin, Statistics - Auto Car Sales Prediction: A Statistical Study Using Functional Data Analysis and Time Series 
  • Kelsey Pakkala, Statistics - A Functional Data Analysis Approach to Women’s Health Screening Adherence for Breast Cancer and Cervical Cancer  
  • Emily Slade, Statistics - Functional Data Analysis in Cephalometric Tracing and Mandibular Examination
  • Ben Charoenwong, Statistics - An Exploration of Simple Optimized Technical Trading Strategies
  • Matthew Lomont, Statistics - Detecting Active Pathways in Gene Sets
  • Xuanzhong Wang, Statistics - An Exploration of Influential Observations in the Panel Study of Income Dynamics  - An Exploration of Gender Gap in Labor Market; Money Resource Allocation to Children in PSID
  • Christopher Worsham, Statistics - A Stochastic Model of Retinal Development in Zebrafish

Faculty Supervising Undergraduate Research

• Danny Almiral l supervises undergraduate researchers with an interest in applied issues in causal inference, dynamic treatment regimens and sequential multiple assignment randomized trials (SMART). Projects include: o Topics in design and analysis of clinical trials for adaptive treatment plans, by Hwanwoo Kim. Co-advised with Ed Ionides. 2nd prize winner in the national Undergraduate Statistics Project Competition. o Adaptive intervention designs in substance use prevention. o An Investigation of Predictor for Tailoring Ecological momentary Assessment and Contextual Recall. o Introduction to Sequential Multiple Assignment Randomized Trials (SMARTs) with Zero Inflated Count Outcomes for the Development of Dynamic Treatment Regimens (DTRs): with application to substance use research.

If you are interested in working with Dr. Almirall, please visit his web page first to see if he is currently accepting new students: http://www-personal.umich.edu/~dalmiral/ .

• Moulinath Banerjee has supervised undergraduate projects including: o Detecting Active Pathways in Gene Sets.

• Ben Hansen has supervised undergraduate projects including: o Proposals for Generating and Utilizing Well Informed Initialization Values to Improve the Computational Efficiency of Optmatch.

• Xuming He supervises UROP students and advanced undergraduate research in a broad area of statistics. Examples include: o Monte Carlo evaluation of Value-at-Risk. o Ordering of multivariate Data.

• Al Hero has supervised undergraduate projects including: o Dynamic distributed multidimensional scaling (MDS) for data visualization. o Spatio-temporal network anomaly detection in Abilene data streams. o Canonical correlation analysis for sunspot and coronal mass ejection image representation.

• Tailen Hsing has supervised undergraduate projects including: o Analyzing Argo Data Co-advised with Stilian Stoev o Argo Data Mean Field Modeling Co-advised with Stilian Stoev

• Ed Ionides has supervised undergraduate projects including: o Topics in design and analysis of clinical trials for adaptive treatment plans. Co-advised with Danny Almirall. 2nd prize winner in the national Undergraduate Statistics Project Competition. o Modeling cholera as a stochastic process. o Building POMP objects in R for a dynamic general stochastic equilibrium model.. o Investigating sequential Monte Carlo methods for time series analysis. o Identification of insurance companies at risk of insolvency. Co-advised with Kristen Moore.

• Long Nguyen has supervised undergraduate projects including: o Traffic Flow and Density Analysis of NYC TLC Taxi Data. o NYC Optimal Transport and Ridesharing.

• Kerby Shedden supervises undergraduate research with an emphasis on bioinformatics. Examples include: o Statistical analysis of high frequency motion capture and muscle activity data: applications to assessing development of trunk postural control. o Sparsity in the distribution of correlation coefficients in molecular screening data. Co-advised with Ji Zhu. o Individual-specific and disease-specific factors in acquired copy number variations in cancer. o Detection of DNA lesions in acute myelogenous leukemia. o Two-tiered false discovery rates. o Selective targeting of stem-cell-like cancer cell lines. Co-advised with Gus Rosania.

• Ambuj Tewari has supervised undergraduate research projects and an honors theses. Former projects include: o Development of an Android app for mobile health. o Simulations comparing bandit algorithms. o Development of HeartSteps, an Android app for encouraging physical activity. Co-advised with Predrag Klasnja o Empirical evaluation of online learning algorithms (honors thesis). o Numerical experiments with Lasso in high dimensional VAR models.

• Ji Zhu has supervised undergraduate research projects and honors theses. Projects include: o Forecasting Stock Returns in the Chinese Market with Convolutional Neural Networks. o Medical Image Classification Building Upon Pre-trained Neural Networks: An Application on Diabetic Retinopathy Detection.

Undergraduate Research Opportunity Program (UROP)

UROP is a great way to get an introduction to research during the first two years at University of Michigan. See the  UROP website  for more information. For the most part, Statistics and Data Science research projects require foundational preparation in statistics, mathematics and computer programming. Sometimes, first year students have sufficient preparation through AP courses and other experiences. Otherwise, it may be appropriate to take introductory statistics, computer programming and calculus courses in the first year to be ready for a second year UROP project.

Other Opportunities for Undergraduate Research

It is possible to conduct undergraduate research that does not fall into either the honors program or UROP. If you find yourself interested in the research agenda of a Statistics faculty member, you can email to enquire about available options. This research can be carried out as part of Stats 489 [Independent Study in Statistics], as a paid position if one is available, or as an informal arrangement for neither course credit nor payment. Arrangements must be made on a case-by-case basis with the potential faculty superviser.

LSA - College of Literature, Science, and The Arts - University of Michigan

  • Information For
  • Prospective Students
  • Current Students
  • Faculty and Staff
  • More about LSA
  • How Do I Apply?
  • LSA Magazine
  • Student Resources
  • Academic Advising
  • Global Studies
  • LSA Opportunity Hub
  • Social Media
  • Update Contact Info
  • Privacy Statement
  • Report Feedback

research data analysis michigan

Still accepting applications for online and hybrid programs!

  • Skip to content
  • Skip to search
  • Accessibility Policy
  • Report an Accessibility Issue

Logo for the School of Public Health

Clinical Research Design and Statistical Analysis

The Clinical Research Design and Statistical Analysis program (OJOC CRDSA)  is now accepting  applications for the cohort starting in Fall 2022.  The admission is on a rolling basis.

Future Program Dates: October 2022–April 2024

Offered by the Department of Biostatistics , the On-Job/On-Campus Master's in Clinical Research Design and Statistical Analysis (CRDSA) Program was developed in a non-residential format to provide a means for working professionals who are interested in clinical research to develop expertise in research design and statistical analysis while continuing their professional employment. The program is open to a broad spectrum of candidates such as physicians, dentists, pharmacists, pharmacologists, basic scientists, study coordinators, managers and others who are interested in pursuing a career in Clinical Research and Clinical Trials.

The CRDSA program is designed to improve the quality of clinical research and to address the shortage of persons with clinical expertise who are trained in quantitative research methods. These problems reflect the increasing complexity of clinical research, the increasing value of that clinical research, and the limited training of health professionals in research design and statistical analysis. Most clinical training programs include only minimal introductory statistics and research design activities. This CRDSA program is designed specifically to teach the skills and knowledge needed to carry out quantitative clinical research.

Non-Residential Format

For working professionals.

Health care professoinals develop expertise in research design and statistical analysis while continuing their professional employment.

Apply Today

SCHEDULED FOR WORKING PROFESSIONALS

The U-M School of Public Health pioneered the non-residential OJ/OC format in 1972. Over 500 students have successfully completed the CRDSA program since then. Participants meet in Ann Arbor at the School of Public Health for a four-day weekend (Thursday, Friday, Saturday, and Sunday) once every four to five weeks for thirty hours of class time. Between weekends on campus, participants complete course assignments and work on projects while remaining on their jobs. This Master of Science Degree Program lasts nineteen weekends.

INTEGRATION WITH EMPLOYMENT

Each student's work setting becomes a personal research laboratory; and many of the program's assignments are created with the work setting in mind. Participants design and implement research projects, review proposals, and critique literature in their fields. The integration of work and academics increases the effectiveness of the program by making it part of, rather than isolated from, practice.

PROGRAM CONTENT

The content of the program can be defined in a number of ways: the purposes of research, research design concepts, data collection methods, and statistical or analytical methods. The program provides concepts and methods that relate to the purposes of clinical research, clinical epidemiology, clinical trials, program evaluation, and technology assessment. Research design concepts include the traditional approaches to the scientific method: the concepts of validity, reliability, causal relationships, the role of randomization, standards for comparison, and sampling, as well as other recently developed methods of approaching decisions about research outcomes such as decision analysis and cost-utility analysis. The data collection methods deal with instrumentation, questionnaire construction, nonreactive measures, survey techniques, qualitative data, measurement and standardization problems, concepts and criteria of normalcy, and disease and diagnostic criteria. Statistical techniques for estimation and hypothesis testing are presented, including comparison of proportions, chi-square test, comparison of means, analysis of variance and covariance, multiple regression analysis, logistic regression, and survival analysis. In addition to a comprehensive curriculum in research design and statistical analysis, other content relevant to clinical researchers includes: ethical and legal issues in clinical research, technical writing skills and proposal/report writing, management of research, and behavioral factors in clinical research. Students learn computer skills and concepts, including data file management, data organization, and use of statistical packages. Visiting faculty with experience in specialized research subjects meet with the students to discuss current problems in clinical research.

The MS in CRDSA is administered through the Horace H. Rackham School of Graduate Studies. To apply for the CRDSA program please fill out a Rackham Graduate School application . All applicants should document their current or potential involvement in research including clinical studies, clinical research, clinical epidemiology, or clinical trials and quantitative skills if available.

Application Procedure

The applicants to the CRDSA program must submit the following supplemental materials (online) in addition to a complete Rackham Application and non-refundable application fee:

  • Official transcripts of all college-level education (upload copy and/or mail original with application).
  • Three letters of recommendation (your recommenders submit them online).
  • Statement of Purpose (submit online with application).
  • Personal Statement (submit online with application).
  • Curriculum Vitae (submit online with application).
  • International students whose first language is not English must also submit TOEFL or MELAB scores

Standardized test scores are not required for applicants applying to any master- or doctoral-level programs for the fall 2022 academic year.

Questions regarding admissions or the application process may be directed to:

Telephone : 734-615-9812 Fax: 734-763-2215 E-mail: [email protected]

Return to top

Course Descriptions

Foundations of public health practice.

This module was designed to address 12 foundational learning objectives of public health. Completion of this module and exam are part of the curricular requirements for MPH, MHSA, MHI, MS, and PhD students at Michigan Public Health. The module is divided into four parts - you will hear from UM faculty and practitioners from the region about the core philosophies of public health, the importance of evidence-based practice, the various factors that influence health, and the necessity of taking an ecological perspective to population health.

BIOSTAT 511: Computer Packages

An introduction to statistical computer packages in both network and microcomputer environments. Data organization and file management will also be discussed.

BIOSTAT 517: Survey Sampling for Clinical Research

The main sampling methods used for surveys in clinical research are discussed, including: probability sampling; simple random sampling; stratified sampling; systematic sampling; multi-stage sampling; sampling with probability proportional to size; cost factors; sampling errors; non-response; sampling frame problems; non-sampling errors; practical designs and procedures.

BIOSTAT 523: Statistical Methods for Epidemiology

Statistical methods commonly used in clinical research, with an emphasis on choosing appropriate procedures and subsequent interpretation. Topics covered: 2 x 2 tables, Mantel-Haenszel, tests for trend in risk, methods for matched designs, logistic regression, and Cox models.

BIOSTAT 524: Biostatistics for Clinical Researchers

Basic probability theory and statistical methods used by biostatisticians. These include design of experiments, point and interval estimation, and hypothesis testing. New topics include simple and multiple regression methods, and analysis of variance and covariance.

BIOSTAT 525: Multiple Linear Regression Including Anova

This course introduces linear regression methodology for continuous outcomes using multiple predictor variables. The course teaches how to interpret association between outcomes and explanatory variables, techniques for building predictive models, and methodologies for model diagnostics; all under the assumption of a normally distributed outcome. R and SAS will be used.

BIOSTAT 526: Topics In Biostatistics

The course consists of three modular topics: (1) Introduction Bayesian statistics and its applications in clinical research; (2) Strategies for dealing with missing data in outcomes and covariates in clinical research; and (3) Strategies for the analysis of “Omics” data in clinical research. Basic concepts and applications will be discussed through case studies.

BIOSTAT 558: Clinical Trials and Study Design

This course is designed for individuals interested in the scientific, policy, and management aspects of clinical trials, with emphasis on scientifically rigorous trial design. Topics include types of clinical research, study design, treatment allocation, randomization and stratification, quality control, sample size requirements, patient consent, and interpretation of results. This course will additionally cover strengths and limitations of alternative study designs such as quasi-experiments and observational studies. Common sources of bias in these alternative study designs will be described along with design approaches to minimize bias.

BIOSTAT 581: Biostatistical Modeling in Clinical Research

This is a course in statistical modeling, with an emphasis on models for correlated data that arise when subjects are repeatedly measured or are clustered. These models, called mixed models, are extensions of linear and nonlinear regression and analysis of variance. Examples will be drawn from clinical studies, such as multi-arm biomarker studies and crossover trials. Analyses of population pharmacokinetics and longitudinal data will also be discussed. Hands-on data analysis and presentation using standard computer software for linear and nonlinear analysis will be emphasized. Course goals include the ability to formulate and evaluate a model, to read the scientific literature that employs these models, to interact fruitfully with data modeling specialists, and to present the results of these models mathematically and graphically.

BIOSTAT 590: Statistical Analysis and Presentation of Research Topics

This course is intended to integrate and apply biostatistical and epidemiologic methods presented in other OJ/OC courses to clinical research data. Students will identify the scientific objectives of a clinical research study and develop a statistical analysis strategy appropriate for those objectives; plan strategies for statistical design and analysis and implement these strategies; learn to be aware of problems that arise in data collection; learn to communicate through presentation of oral and written reports and through student and faculty critiques of these reports; learn to communicate results of clinical research projects in clear, accurate, concise language; learn appropriate writing styles and formats for clinical research articles, and apply writing skills to research papers.

BIOSTAT 599: Planning and Funding Clinical Research

This course will encompass four main areas of exploration. The preparation of a written document whose focus is on an integrated research plan including specific aims, background and significance, design, methods, logistical implementation and statistical analysis, and fiscal requirements. The evaluation of clinical research plans. Identification of funding sources and their requirements. And identification of the role of the research administrator in facilitating clinical research.

EPID 601 : Methods of Epidemiology: Measure and Measurement

An overview and introduction to the measure of association used in epidemiologic studies, as well as a description of the nature and characteristics of major epidemiologic study designs.

CRDSA Faculty

Mousumi Banerjee

Samer Al Hadidi, MD MS FACP - CRDSA Class of 2017

READ MORE TESTIMONIALS HERE

Information

E-mail : [email protected] Telephone : 734-615-9812

Mail : Department of Biostatistics OJ/OC Program in CRDSA School of Public Health University of Michigan 1415 Washington Heights Ann Arbor, MI 48109-2029

Fax : 734-763-2215

Information For

  • Prospective Students
  • Current Students
  • Alumni and Donors
  • Community Partners and Employers
  • About Public Health
  • How Do I Apply?
  • Departments
  • Findings magazine

Student Resources

  • Career Development
  • Certificates
  • Internships
  • The Heights Intranet
  • Update Contact Info
  • Report Website Feedback

research data analysis michigan

Michigan Research Cores

Data Tools and Analysis

Find a core facility to help with your research.

Search below to explore services, equipment, locations, and more

research data analysis michigan

Applied Biostatics Laboratory & Data Management Core

The goal for the Applied Biostatistics Laboratory is to further advance the methodological and statistical rigor of research projects

Dr. Robert Ploutz-Snyder (734)647-0462 [email protected]

research data analysis michigan

Biological Mass Spectrometry Facility

Analyzing biological and large macromolecular samples using mass spectrometry.

Carmen Dunbar (734) 647-2878 [email protected]

research data analysis michigan

BRCF Bioinformatics Core

assists with the experimental design, workflow development, and analysis of next-generation sequencing data.

Bioinformatics Core 734-998-9249 [email protected]

research data analysis michigan

Cardiovascular Regeneration Core Laboratory

generates patient-specific human induced pluripotent stem cells to study cardiovascular diseases.

Todd Herron, PhD 734-998-0460 [email protected]

research data analysis michigan

Center for Chemical Genomics (CCG)

provides high-throughput screening of extensive small molecule, natural product and siRNA libraries.

Andrew Alt, PhD 734-763-2340 [email protected]

research data analysis michigan

Center for Molecular Imaging (CMI) & Preclinical Molecular Imaging (PMI)

provide instrumentation and expertise in non-invasive animal imaging.

Amanda Fair 734-615-3009 [email protected]

research data analysis michigan

Data & Design Core, Eisenberg Family Depression Center

Offers statistical support and guidance for leveraging secondary data resources for mental health research

Meghan Seewald [email protected]

research data analysis michigan

Data Office for Clinical & Translational Research (DOCTR)

enables secure access to patient data through investigator-friendly, self-serve tools and custom data extracts.

Data Office 734-615-2100 [email protected]

research data analysis michigan

Data Science, Biostatistics, and Informatics Core

provides experimental design, data management, health analytics, mathematical modeling, and statistical computing.

Ivo Dinov 734-764-1111 [email protected]

research data analysis michigan

Electron Microbeam Analysis Lab (EMAL)

provides instrumentation for the chemical and structural characterization of solid materials.

Owen Neill, PhD 734-615-6657 [email protected]

research data analysis michigan

Human Research 3 Tesla Magnetic Resonance Imaging (Research 3T)

provides anatomic and advanced MR imaging services for IRB-approved, sponsor-funded studies and clinical trials.

Thomas L. Chenevert, PhD 734-936-8866 [email protected]

research data analysis michigan

Institute for Healthcare and Policy Innovation (IHPI)

access to 20 terabytes of data, from more than 113 million Americans, for researchers to study how healthcare works.

Patrick Brady, MHA 734-763-4335 [email protected]

research data analysis michigan

Inter-University Consortium for Political and Social Research Virtual Data Enclave (ICPSR)

maintains a data archive of more than 250,000 files of research in the social and behavioral sciences.

Stuart Hutchings 734-615-7362 [email protected]

Kidney Epidemiology and Cost Center (KECC)

manages and maintains a secure HIPAA and FISMA enclave and provides Expert Consultation as well as Staff Effort

Kidney Epidemiology and Cost Center (734) 763-6611 [email protected]

research data analysis michigan

Library Visualization Services

provides free support for data visualization across all disciplines.

Library Visualization Services [email protected]

research data analysis michigan

Michigan Center for Materials Characterization (MC)2

is dedicated to the micron and nanoscale imaging and analysis of materials.

Bobby Kerns 734-647-6845 [email protected]

research data analysis michigan

Michigan Flora Online

presents the basic information about all vascular plants known to occur outside of cultivation in the state.

Anton A. Reznicek 734-764-5544 [email protected]

research data analysis michigan

Michigan Institute for Clinical & Health Research (MICHR)

educates, funds, connects & supports clinical and translational research teams.

MICHR (734) 998-7474 [email protected]

research data analysis michigan

Michigan Integrative Musculoskeletal Health Core Center (MiMHC)

Our 3 Cores provide analyses aimed at understanding musculoskeletal health using paraffin and plastic (hard tissue) histology and training, micro/nanoCT imaging, Raman spectrometry, whole animal/tissue level testing, Omics and machine learning support.

Karl Jepsen, PhD [email protected]

research data analysis michigan

Micro & Nano Computed Tomography Advanced Imaging Core

provides 3-D imaging and quantitative analysis of structures/materials, including metals, silica-based chips and plastics.

Andrea Clark 734-615-6956 [email protected]

Total Cores: 29

Crash and Driving Data Analysis

UMTRI experts have access to petabytes of efficient and high-quality naturalistic data sets, crash data for the U.S. and several states, as well as the largest set of connected vehicle data in the world.  This data will better inform manufacturers, and policy makers in their pursuit of a national connected vehicle network and AV deployment. 

The Center for the Management of Information for Safe and Sustainable Transportation (CMISST) Group gathers, combines, and analyzes all types of transportation datasets to answer pressing questions in transportation safety and efficiency.

CMISST has one of the largest, most efficient and high-quality naturalistic sets of data around.  In fact, they have the largest set of connected vehicle data in the world.  Combined with our state-of-the-art data management systems and research expertise, CMISST experts are leading the way to a better understanding of factors that impact transportation safety and mobility, and identify the most effective crash and injury countermeasures.

Selected Publications

  • Bálint A, Flannagan CA, Leslie A, Klauer S, Guo F, & Dozza M. (2020).  Multitasking additional-to-driving : Prevalence, structure, and associated risk in SHRP2 naturalistic driving data. Accident Analysis and Prevention 137 .
  • Benedetti M, Klinich KD, Manary MA, Flannagan CA (2017) Predictors of restraint use among child occupants . Traffic Injury Prevention 18(8):866-860.
  • Benedetti M, Klinich KD, Manary MA, Flannagan CA. (2019) Factors affecting child injury risk in motor-vehicle crashes. Stapp Car Crash Journal 63.
  • Blower D, Flannagan C, Geedipally S, Lord D, & Wunderlich R. (2019)  Identification of Factors Contributing to the Decline of Traffic Fatalities in the United States from 2008 to 2012 . National Academies Press.
  • Buckley L, Bingham C R, Flannagan C A, Carter P M, Almani F, and Cicchino J B (2016) Observation of motorcycle helmet use rates in Michigan after partial repeal of the universal motorcycle helmet law .  Accident Analysis & Prevention ,  95 , 178-186.
  • Carter PM, Buckley L, Flannagan CA, Cicchino JB, Hemmila M, Bowman PJ, and Bingham CR (2017) The impact of Michigan’s partial repeal of the universal motorcycle helmet law on helmet use, fatalities, and head injuries .  American Journal of Public Health  107(1):166-172.
  • Flannagan C, Selpi, Boyraz P, Leslie A, Kovaceva J, & Thomson R. (Mar 2019) Analysis of SHRP2 Data to Understand Normal and Abnormal Driving Behavior in Work Zones: Final Report. Federal Highway Administration. FHWA-HRT-20-010.   https://www.fhwa.dot.gov/
  • Flannagan CA & Leslie A. (2020).  Crash Avoidance Technology Evaluation Using Real-World Crash Data  (No. DOT HS 812 841). United States. Department of Transportation. National Highway Traffic Safety Administration.
  • FlannaganCA, Bálint A, Klinich KD, Sander U, ManaryMA, CunyS, McCarthy M, Phan V, Wallbank C, Green PE, Sui B, Forsman A, Fagerlind H. (2018) Comparing motor-vehicle crash risk of EU and US vehicles, Accident Analysis and Prevention 117:392-397. DOI:10:1016/j.aap.2018.01.003
  • Klinich KD, Bowman P, Flannagan CA, Rupp JD (2016) Injury Patterns in Motor-Vehicle Crashes in the United States 1998-2014 , UMTRI 2016-06, University of Michigan Transportation Research Institute.
  • Kostyniuk LP, St. Louis RM, Zakrajsek J, Stanciu S, Zanier N, and Molnar LJ. (2017) Societal Costs of Traffic Crashes and Crime in Michigan: 2017 Update (UMTRI-2017-01). Ann Arbor, MI: University of Michigan Transportation Research Institute.
  • Newnam S, Blower D, Molnar LJ, Eby DW, and Koppel S. (2018) Exploring crash characteristics and injury outcomes: an analysis of truck-involved crash data in the US . Safety Science 106:140-145.
  • Rafei A, Flannagan CA, & Elliott MR. (2020) Big Data for Finite Population Inference: Applying Quasi-Random Approaches to Naturalistic Driving Data Using Bayesian Additive Regression Trees . Journal of Survey Statistics and Methodology 8(1) : 148–180.
  • Tan YV, Flannagan CA, and Elliott MR (2019) “Robust-squared” imputation models using BART . Journal of Survey Statistics and Methodology
  • Wu J, Shan C, Chou CC, Hu J, Cao L, Jiang B (2018) Age effects on injury pattern of rear-seat child occupants in frontal crashes . International Journal of Vehicle Safety , 10(3-4): 288-300.
  • Yu B, Bao S, Chen Y (2019) Quantifying Visual Road Environment to Establish a Speeding Prediction Model: An Examination Using Naturalistic Driving Data . Accident Analysis & Prevention 129: 289-298.

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • View all journals
  • My Account Login
  • Explore content
  • About the journal
  • Publish with us
  • Sign up for alerts
  • Data Descriptor
  • Open access
  • Published: 03 May 2024

A dataset for measuring the impact of research data and their curation

  • Libby Hemphill   ORCID: orcid.org/0000-0002-3793-7281 1 , 2 ,
  • Andrea Thomer 3 ,
  • Sara Lafia 1 ,
  • Lizhou Fan 2 ,
  • David Bleckley   ORCID: orcid.org/0000-0001-7715-4348 1 &
  • Elizabeth Moss 1  

Scientific Data volume  11 , Article number:  442 ( 2024 ) Cite this article

686 Accesses

8 Altmetric

Metrics details

  • Research data
  • Social sciences

Science funders, publishers, and data archives make decisions about how to responsibly allocate resources to maximize the reuse potential of research data. This paper introduces a dataset developed to measure the impact of archival and data curation decisions on data reuse. The dataset describes 10,605 social science research datasets, their curation histories, and reuse contexts in 94,755 publications that cover 59 years from 1963 to 2022. The dataset was constructed from study-level metadata, citing publications, and curation records available through the Inter-university Consortium for Political and Social Research (ICPSR) at the University of Michigan. The dataset includes information about study-level attributes (e.g., PIs, funders, subject terms); usage statistics (e.g., downloads, citations); archiving decisions (e.g., curation activities, data transformations); and bibliometric attributes (e.g., journals, authors) for citing publications. This dataset provides information on factors that contribute to long-term data reuse, which can inform the design of effective evidence-based recommendations to support high-impact research data curation decisions.

Similar content being viewed by others

research data analysis michigan

SciSciNet: A large-scale open data lake for the science of science research

research data analysis michigan

Data, measurement and empirical methods in the science of science

research data analysis michigan

Interdisciplinarity revisited: evidence for research impact and dynamism

Background & summary.

Recent policy changes in funding agencies and academic journals have increased data sharing among researchers and between researchers and the public. Data sharing advances science and provides the transparency necessary for evaluating, replicating, and verifying results. However, many data-sharing policies do not explain what constitutes an appropriate dataset for archiving or how to determine the value of datasets to secondary users 1 , 2 , 3 . Questions about how to allocate data-sharing resources efficiently and responsibly have gone unanswered 4 , 5 , 6 . For instance, data-sharing policies recognize that not all data should be curated and preserved, but they do not articulate metrics or guidelines for determining what data are most worthy of investment.

Despite the potential for innovation and advancement that data sharing holds, the best strategies to prioritize datasets for preparation and archiving are often unclear. Some datasets are likely to have more downstream potential than others, and data curation policies and workflows should prioritize high-value data instead of being one-size-fits-all. Though prior research in library and information science has shown that the “analytic potential” of a dataset is key to its reuse value 7 , work is needed to implement conceptual data reuse frameworks 8 , 9 , 10 , 11 , 12 , 13 , 14 . In addition, publishers and data archives need guidance to develop metrics and evaluation strategies to assess the impact of datasets.

Several existing resources have been compiled to study the relationship between the reuse of scholarly products, such as datasets (Table  1 ); however, none of these resources include explicit information on how curation processes are applied to data to increase their value, maximize their accessibility, and ensure their long-term preservation. The CCex (Curation Costs Exchange) provides models of curation services along with cost-related datasets shared by contributors but does not make explicit connections between them or include reuse information 15 . Analyses on platforms such as DataCite 16 have focused on metadata completeness and record usage, but have not included related curation-level information. Analyses of GenBank 17 and FigShare 18 , 19 citation networks do not include curation information. Related studies of Github repository reuse 20 and Softcite software citation 21 reveal significant factors that impact the reuse of secondary research products but do not focus on research data. RD-Switchboard 22 and DSKG 23 are scholarly knowledge graphs linking research data to articles, patents, and grants, but largely omit social science research data and do not include curation-level factors. To our knowledge, other studies of curation work in organizations similar to ICPSR – such as GESIS 24 , Dataverse 25 , and DANS 26 – have not made their underlying data available for analysis.

This paper describes a dataset 27 compiled for the MICA project (Measuring the Impact of Curation Actions) led by investigators at ICPSR, a large social science data archive at the University of Michigan. The dataset was originally developed to study the impacts of data curation and archiving on data reuse. The MICA dataset has supported several previous publications investigating the intensity of data curation actions 28 , the relationship between data curation actions and data reuse 29 , and the structures of research communities in a data citation network 30 . Collectively, these studies help explain the return on various types of curatorial investments. The dataset that we introduce in this paper, which we refer to as the MICA dataset, has the potential to address research questions in the areas of science (e.g., knowledge production), library and information science (e.g., scholarly communication), and data archiving (e.g., reproducible workflows).

We constructed the MICA dataset 27 using records available at ICPSR, a large social science data archive at the University of Michigan. Data set creation involved: collecting and enriching metadata for articles indexed in the ICPSR Bibliography of Data-related Literature against the Dimensions AI bibliometric database; gathering usage statistics for studies from ICPSR’s administrative database; processing data curation work logs from ICPSR’s project tracking platform, Jira; and linking data in social science studies and series to citing analysis papers (Fig.  1 ).

figure 1

Steps to prepare MICA dataset for analysis - external sources are red, primary internal sources are blue, and internal linked sources are green.

Enrich paper metadata

The ICPSR Bibliography of Data-related Literature is a growing database of literature in which data from ICPSR studies have been used. Its creation was funded by the National Science Foundation (Award 9977984), and for the past 20 years it has been supported by ICPSR membership and multiple US federally-funded and foundation-funded topical archives at ICPSR. The Bibliography was originally launched in the year 2000 to aid in data discovery by providing a searchable database linking publications to the study data used in them. The Bibliography collects the universe of output based on the data shared in each study through, which is made available through each ICPSR study’s webpage. The Bibliography contains both peer-reviewed and grey literature, which provides evidence for measuring the impact of research data. For an item to be included in the ICPSR Bibliography, it must contain an analysis of data archived by ICPSR or contain a discussion or critique of the data collection process, study design, or methodology 31 . The Bibliography is manually curated by a team of librarians and information specialists at ICPSR who enter and validate entries. Some publications are supplied to the Bibliography by data depositors, and some citations are submitted to the Bibliography by authors who abide by ICPSR’s terms of use requiring them to submit citations to works in which they analyzed data retrieved from ICPSR. Most of the Bibliography is populated by Bibliography team members, who create custom queries for ICPSR studies performed across numerous sources, including Google Scholar, ProQuest, SSRN, and others. Each record in the Bibliography is one publication that has used one or more ICPSR studies. The version we used was captured on 2021-11-16 and included 94,755 publications.

To expand the coverage of the ICPSR Bibliography, we searched exhaustively for all ICPSR study names, unique numbers assigned to ICPSR studies, and DOIs 32 using a full-text index available through the Dimensions AI database 33 . We accessed Dimensions through a license agreement with the University of Michigan. ICPSR Bibliography librarians and information specialists manually reviewed and validated new entries that matched one or more search criteria. We then used Dimensions to gather enriched metadata and full-text links for items in the Bibliography with DOIs. We matched 43% of the items in the Bibliography to enriched Dimensions metadata including abstracts, field of research codes, concepts, and authors’ institutional information; we also obtained links to full text for 16% of Bibliography items. Based on licensing agreements, we included Dimensions identifiers and links to full text so that users with valid publisher and database access can construct an enriched publication dataset.

Gather study usage data

ICPSR maintains a relational administrative database, DBInfo, that organizes study-level metadata and information on data reuse across separate tables. Studies at ICPSR consist of one or more files collected at a single time or for a single purpose; studies in which the same variables are observed over time are grouped into series. Each study at ICPSR is assigned a DOI, and its metadata are stored in DBInfo. Study metadata follows the Data Documentation Initiative (DDI) Codebook 2.5 standard. DDI elements included in our dataset are title, ICPSR study identification number, DOI, authoring entities, description (abstract), funding agencies, subject terms assigned to the study during curation, and geographic coverage. We also created variables based on DDI elements: total variable count, the presence of survey question text in the metadata, the number of author entities, and whether an author entity was an institution. We gathered metadata for ICPSR’s 10,605 unrestricted public-use studies available as of 2021-11-16 ( https://www.icpsr.umich.edu/web/pages/membership/or/metadata/oai.html ).

To link study usage data with study-level metadata records, we joined study metadata from DBinfo on study usage information, which included total study downloads (data and documentation), individual data file downloads, and cumulative citations from the ICPSR Bibliography. We also gathered descriptive metadata for each study and its variables, which allowed us to summarize and append recoded fields onto the study-level metadata such as curation level, number and type of principle investigators, total variable count, and binary variables indicating whether the study data were made available for online analysis, whether survey question text was made searchable online, and whether the study variables were indexed for search. These characteristics describe aspects of the discoverability of the data to compare with other characteristics of the study. We used the study and series numbers included in the ICPSR Bibliography as unique identifiers to link papers to metadata and analyze the community structure of dataset co-citations in the ICPSR Bibliography 32 .

Process curation work logs

Researchers deposit data at ICPSR for curation and long-term preservation. Between 2016 and 2020, more than 3,000 research studies were deposited with ICPSR. Since 2017, ICPSR has organized curation work into a central unit that provides varied levels of curation that vary in the intensity and complexity of data enhancement that they provide. While the levels of curation are standardized as to effort (level one = less effort, level three = most effort), the specific curatorial actions undertaken for each dataset vary. The specific curation actions are captured in Jira, a work tracking program, which data curators at ICPSR use to collaborate and communicate their progress through tickets. We obtained access to a corpus of 669 completed Jira tickets corresponding to the curation of 566 unique studies between February 2017 and December 2019 28 .

To process the tickets, we focused only on their work log portions, which contained free text descriptions of work that data curators had performed on a deposited study, along with the curators’ identifiers, and timestamps. To protect the confidentiality of the data curators and the processing steps they performed, we collaborated with ICPSR’s curation unit to propose a classification scheme, which we used to train a Naive Bayes classifier and label curation actions in each work log sentence. The eight curation action labels we proposed 28 were: (1) initial review and planning, (2) data transformation, (3) metadata, (4) documentation, (5) quality checks, (6) communication, (7) other, and (8) non-curation work. We note that these categories of curation work are very specific to the curatorial processes and types of data stored at ICPSR, and may not match the curation activities at other repositories. After applying the classifier to the work log sentences, we obtained summary-level curation actions for a subset of all ICPSR studies (5%), along with the total number of hours spent on data curation for each study, and the proportion of time associated with each action during curation.

Data Records

The MICA dataset 27 connects records for each of ICPSR’s archived research studies to the research publications that use them and related curation activities available for a subset of studies (Fig.  2 ). Each of the three tables published in the dataset is available as a study archived at ICPSR. The data tables are distributed as statistical files available for use in SAS, SPSS, Stata, and R as well as delimited and ASCII text files. The dataset is organized around studies and papers as primary entities. The studies table lists ICPSR studies, their metadata attributes, and usage information; the papers table was constructed using the ICPSR Bibliography and Dimensions database; and the curation logs table summarizes the data curation steps performed on a subset of ICPSR studies.

Studies (“ICPSR_STUDIES”): 10,605 social science research datasets available through ICPSR up to 2021-11-16 with variables for ICPSR study number, digital object identifier, study name, series number, series title, authoring entities, full-text description, release date, funding agency, geographic coverage, subject terms, topical archive, curation level, single principal investigator (PI), institutional PI, the total number of PIs, total variables in data files, question text availability, study variable indexing, level of restriction, total unique users downloading study data files and codebooks, total unique users downloading data only, and total unique papers citing data through November 2021. Studies map to the papers and curation logs table through ICPSR study numbers as “STUDY”. However, not every study in this table will have records in the papers and curation logs tables.

Papers (“ICPSR_PAPERS”): 94,755 publications collected from 2000-08-11 to 2021-11-16 in the ICPSR Bibliography and enriched with metadata from the Dimensions database with variables for paper number, identifier, title, authors, publication venue, item type, publication date, input date, ICPSR series numbers used in the paper, ICPSR study numbers used in the paper, the Dimension identifier, and the Dimensions link to the publication’s full text. Papers map to the studies table through ICPSR study numbers in the “STUDY_NUMS” field. Each record represents a single publication, and because a researcher can use multiple datasets when creating a publication, each record may list multiple studies or series.

Curation logs (“ICPSR_CURATION_LOGS”): 649 curation logs for 563 ICPSR studies (although most studies in the subset had one curation log, some studies were associated with multiple logs, with a maximum of 10) curated between February 2017 and December 2019 with variables for study number, action labels assigned to work description sentences using a classifier trained on ICPSR curation logs, hours of work associated with a single log entry, and total hours of work logged for the curation ticket. Curation logs map to the study and paper tables through ICPSR study numbers as “STUDY”. Each record represents a single logged action, and future users may wish to aggregate actions to the study level before joining tables.

figure 2

Entity-relation diagram.

Technical Validation

We report on the reliability of the dataset’s metadata in the following subsections. To support future reuse of the dataset, curation services provided through ICPSR improved data quality by checking for missing values, adding variable labels, and creating a codebook.

All 10,605 studies available through ICPSR have a DOI and a full-text description summarizing what the study is about, the purpose of the study, the main topics covered, and the questions the PIs attempted to answer when they conducted the study. Personal names (i.e., principal investigators) and organizational names (i.e., funding agencies) are standardized against an authority list maintained by ICPSR; geographic names and subject terms are also standardized and hierarchically indexed in the ICPSR Thesaurus 34 . Many of ICPSR’s studies (63%) are in a series and are distributed through the ICPSR General Archive (56%), a non-topical archive that accepts any social or behavioral science data. While study data have been available through ICPSR since 1962, the earliest digital release date recorded for a study was 1984-03-18, when ICPSR’s database was first employed, and the most recent date is 2021-10-28 when the dataset was collected.

Curation level information was recorded starting in 2017 and is available for 1,125 studies (11%); approximately 80% of studies with assigned curation levels received curation services, equally distributed between Levels 1 (least intensive), 2 (moderately intensive), and 3 (most intensive) (Fig.  3 ). Detailed descriptions of ICPSR’s curation levels are available online 35 . Additional metadata are available for a subset of 421 studies (4%), including information about whether the study has a single PI, an institutional PI, the total number of PIs involved, total variables recorded is available for online analysis, has searchable question text, has variables that are indexed for search, contains one or more restricted files, and whether the study is completely restricted. We provided additional metadata for this subset of ICPSR studies because they were released within the past five years and detailed curation and usage information were available for them. Usage statistics including total downloads and data file downloads are available for this subset of studies as well; citation statistics are available for 8,030 studies (76%). Most ICPSR studies have fewer than 500 users, as indicated by total downloads, or citations (Fig.  4 ).

figure 3

ICPSR study curation levels.

figure 4

ICPSR study usage.

A subset of 43,102 publications (45%) available in the ICPSR Bibliography had a DOI. Author metadata were entered as free text, meaning that variations may exist and require additional normalization and pre-processing prior to analysis. While author information is standardized for each publication, individual names may appear in different sort orders (e.g., “Earls, Felton J.” and “Stephen W. Raudenbush”). Most of the items in the ICPSR Bibliography as of 2021-11-16 were journal articles (59%), reports (14%), conference presentations (9%), or theses (8%) (Fig.  5 ). The number of publications collected in the Bibliography has increased each decade since the inception of ICPSR in 1962 (Fig.  6 ). Most ICPSR studies (76%) have one or more citations in a publication.

figure 5

ICPSR Bibliography citation types.

figure 6

ICPSR citations by decade.

Usage Notes

The dataset consists of three tables that can be joined using the “STUDY” key as shown in Fig.  2 . The “ICPSR_PAPERS” table contains one row per paper with one or more cited studies in the “STUDY_NUMS” column. We manipulated and analyzed the tables as CSV files with the Pandas library 36 in Python and the Tidyverse packages 37 in R.

The present MICA dataset can be used independently to study the relationship between curation decisions and data reuse. Evidence of reuse for specific studies is available in several forms: usage information, including downloads and citation counts; and citation contexts within papers that cite data. Analysis may also be performed on the citation network formed between datasets and papers that use them. Finally, curation actions can be associated with properties of studies and usage histories.

This dataset has several limitations of which users should be aware. First, Jira tickets can only be used to represent the intensiveness of curation for activities undertaken since 2017, when ICPSR started using both Curation Levels and Jira. Studies published before 2017 were all curated, but documentation of the extent of that curation was not standardized and therefore could not be included in these analyses. Second, the measure of publications relies upon the authors’ clarity of data citation and the ICPSR Bibliography staff’s ability to discover citations with varying formality and clarity. Thus, there is always a chance that some secondary-data-citing publications have been left out of the bibliography. Finally, there may be some cases in which a paper in the ICSPSR bibliography did not actually obtain data from ICPSR. For example, PIs have often written about or even distributed their data prior to their archival in ICSPR. Therefore, those publications would not have cited ICPSR but they are still collected in the Bibliography as being directly related to the data that were eventually deposited at ICPSR.

In summary, the MICA dataset contains relationships between two main types of entities – papers and studies – which can be mined. The tables in the MICA dataset have supported network analysis (community structure and clique detection) 30 ; natural language processing (NER for dataset reference detection) 32 ; visualizing citation networks (to search for datasets) 38 ; and regression analysis (on curation decisions and data downloads) 29 . The data are currently being used to develop research metrics and recommendation systems for research data. Given that DOIs are provided for ICPSR studies and articles in the ICPSR Bibliography, the MICA dataset can also be used with other bibliometric databases, including DataCite, Crossref, OpenAlex, and related indexes. Subscription-based services, such as Dimensions AI, are also compatible with the MICA dataset. In some cases, these services provide abstracts or full text for papers from which data citation contexts can be extracted for semantic content analysis.

Code availability

The code 27 used to produce the MICA project dataset is available on GitHub at https://github.com/ICPSR/mica-data-descriptor and through Zenodo with the identifier https://doi.org/10.5281/zenodo.8432666 . Data manipulation and pre-processing were performed in Python. Data curation for distribution was performed in SPSS.

He, L. & Han, Z. Do usage counts of scientific data make sense? An investigation of the Dryad repository. Library Hi Tech 35 , 332–342 (2017).

Article   Google Scholar  

Brickley, D., Burgess, M. & Noy, N. Google dataset search: Building a search engine for datasets in an open web ecosystem. In The World Wide Web Conference - WWW ‘19 , 1365–1375 (ACM Press, San Francisco, CA, USA, 2019).

Buneman, P., Dosso, D., Lissandrini, M. & Silvello, G. Data citation and the citation graph. Quantitative Science Studies 2 , 1399–1422 (2022).

Chao, T. C. Disciplinary reach: Investigating the impact of dataset reuse in the earth sciences. Proceedings of the American Society for Information Science and Technology 48 , 1–8 (2011).

Article   ADS   Google Scholar  

Parr, C. et al . A discussion of value metrics for data repositories in earth and environmental sciences. Data Science Journal 18 , 58 (2019).

Eschenfelder, K. R., Shankar, K. & Downey, G. The financial maintenance of social science data archives: Four case studies of long–term infrastructure work. J. Assoc. Inf. Sci. Technol. 73 , 1723–1740 (2022).

Palmer, C. L., Weber, N. M. & Cragin, M. H. The analytic potential of scientific data: Understanding re-use value. Proceedings of the American Society for Information Science and Technology 48 , 1–10 (2011).

Zimmerman, A. S. New knowledge from old data: The role of standards in the sharing and reuse of ecological data. Sci. Technol. Human Values 33 , 631–652 (2008).

Cragin, M. H., Palmer, C. L., Carlson, J. R. & Witt, M. Data sharing, small science and institutional repositories. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 368 , 4023–4038 (2010).

Article   ADS   CAS   Google Scholar  

Fear, K. M. Measuring and Anticipating the Impact of Data Reuse . Ph.D. thesis, University of Michigan (2013).

Borgman, C. L., Van de Sompel, H., Scharnhorst, A., van den Berg, H. & Treloar, A. Who uses the digital data archive? An exploratory study of DANS. Proceedings of the Association for Information Science and Technology 52 , 1–4 (2015).

Pasquetto, I. V., Borgman, C. L. & Wofford, M. F. Uses and reuses of scientific data: The data creators’ advantage. Harvard Data Science Review 1 (2019).

Gregory, K., Groth, P., Scharnhorst, A. & Wyatt, S. Lost or found? Discovering data needed for research. Harvard Data Science Review (2020).

York, J. Seeking equilibrium in data reuse: A study of knowledge satisficing . Ph.D. thesis, University of Michigan (2022).

Kilbride, W. & Norris, S. Collaborating to clarify the cost of curation. New Review of Information Networking 19 , 44–48 (2014).

Robinson-Garcia, N., Mongeon, P., Jeng, W. & Costas, R. DataCite as a novel bibliometric source: Coverage, strengths and limitations. Journal of Informetrics 11 , 841–854 (2017).

Qin, J., Hemsley, J. & Bratt, S. E. The structural shift and collaboration capacity in GenBank networks: A longitudinal study. Quantitative Science Studies 3 , 174–193 (2022).

Article   PubMed   PubMed Central   Google Scholar  

Acuna, D. E., Yi, Z., Liang, L. & Zhuang, H. Predicting the usage of scientific datasets based on article, author, institution, and journal bibliometrics. In Smits, M. (ed.) Information for a Better World: Shaping the Global Future. iConference 2022 ., 42–52 (Springer International Publishing, Cham, 2022).

Zeng, T., Wu, L., Bratt, S. & Acuna, D. E. Assigning credit to scientific datasets using article citation networks. Journal of Informetrics 14 , 101013 (2020).

Koesten, L., Vougiouklis, P., Simperl, E. & Groth, P. Dataset reuse: Toward translating principles to practice. Patterns 1 , 100136 (2020).

Du, C., Cohoon, J., Lopez, P. & Howison, J. Softcite dataset: A dataset of software mentions in biomedical and economic research publications. J. Assoc. Inf. Sci. Technol. 72 , 870–884 (2021).

Aryani, A. et al . A research graph dataset for connecting research data repositories using RD-Switchboard. Sci Data 5 , 180099 (2018).

Färber, M. & Lamprecht, D. The data set knowledge graph: Creating a linked open data source for data sets. Quantitative Science Studies 2 , 1324–1355 (2021).

Perry, A. & Netscher, S. Measuring the time spent on data curation. Journal of Documentation 78 , 282–304 (2022).

Trisovic, A. et al . Advancing computational reproducibility in the Dataverse data repository platform. In Proceedings of the 3rd International Workshop on Practical Reproducible Evaluation of Computer Systems , P-RECS ‘20, 15–20, https://doi.org/10.1145/3391800.3398173 (Association for Computing Machinery, New York, NY, USA, 2020).

Borgman, C. L., Scharnhorst, A. & Golshan, M. S. Digital data archives as knowledge infrastructures: Mediating data sharing and reuse. Journal of the Association for Information Science and Technology 70 , 888–904, https://doi.org/10.1002/asi.24172 (2019).

Lafia, S. et al . MICA Data Descriptor. Zenodo https://doi.org/10.5281/zenodo.8432666 (2023).

Lafia, S., Thomer, A., Bleckley, D., Akmon, D. & Hemphill, L. Leveraging machine learning to detect data curation activities. In 2021 IEEE 17th International Conference on eScience (eScience) , 149–158, https://doi.org/10.1109/eScience51609.2021.00025 (2021).

Hemphill, L., Pienta, A., Lafia, S., Akmon, D. & Bleckley, D. How do properties of data, their curation, and their funding relate to reuse? J. Assoc. Inf. Sci. Technol. 73 , 1432–44, https://doi.org/10.1002/asi.24646 (2021).

Lafia, S., Fan, L., Thomer, A. & Hemphill, L. Subdivisions and crossroads: Identifying hidden community structures in a data archive’s citation network. Quantitative Science Studies 3 , 694–714, https://doi.org/10.1162/qss_a_00209 (2022).

ICPSR. ICPSR Bibliography of Data-related Literature: Collection Criteria. https://www.icpsr.umich.edu/web/pages/ICPSR/citations/collection-criteria.html (2023).

Lafia, S., Fan, L. & Hemphill, L. A natural language processing pipeline for detecting informal data references in academic literature. Proc. Assoc. Inf. Sci. Technol. 59 , 169–178, https://doi.org/10.1002/pra2.614 (2022).

Hook, D. W., Porter, S. J. & Herzog, C. Dimensions: Building context for search and evaluation. Frontiers in Research Metrics and Analytics 3 , 23, https://doi.org/10.3389/frma.2018.00023 (2018).

https://www.icpsr.umich.edu/web/ICPSR/thesaurus (2002). ICPSR. ICPSR Thesaurus.

https://www.icpsr.umich.edu/files/datamanagement/icpsr-curation-levels.pdf (2020). ICPSR. ICPSR Curation Levels.

McKinney, W. Data Structures for Statistical Computing in Python. In van der Walt, S. & Millman, J. (eds.) Proceedings of the 9th Python in Science Conference , 56–61 (2010).

Wickham, H. et al . Welcome to the Tidyverse. Journal of Open Source Software 4 , 1686 (2019).

Fan, L., Lafia, S., Li, L., Yang, F. & Hemphill, L. DataChat: Prototyping a conversational agent for dataset search and visualization. Proc. Assoc. Inf. Sci. Technol. 60 , 586–591 (2023).

Download references

Acknowledgements

We thank the ICPSR Bibliography staff, the ICPSR Data Curation Unit, and the ICPSR Data Stewardship Committee for their support of this research. This material is based upon work supported by the National Science Foundation under grant 1930645. This project was made possible in part by the Institute of Museum and Library Services LG-37-19-0134-19.

Author information

Authors and affiliations.

Inter-university Consortium for Political and Social Research, University of Michigan, Ann Arbor, MI, 48104, USA

Libby Hemphill, Sara Lafia, David Bleckley & Elizabeth Moss

School of Information, University of Michigan, Ann Arbor, MI, 48104, USA

Libby Hemphill & Lizhou Fan

School of Information, University of Arizona, Tucson, AZ, 85721, USA

Andrea Thomer

You can also search for this author in PubMed   Google Scholar

Contributions

L.H. and A.T. conceptualized the study design, D.B., E.M., and S.L. prepared the data, S.L., L.F., and L.H. analyzed the data, and D.B. validated the data. All authors reviewed and edited the manuscript.

Corresponding author

Correspondence to Libby Hemphill .

Ethics declarations

Competing interests.

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Cite this article.

Hemphill, L., Thomer, A., Lafia, S. et al. A dataset for measuring the impact of research data and their curation. Sci Data 11 , 442 (2024). https://doi.org/10.1038/s41597-024-03303-2

Download citation

Received : 16 November 2023

Accepted : 24 April 2024

Published : 03 May 2024

DOI : https://doi.org/10.1038/s41597-024-03303-2

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

Quick links

  • Explore articles by subject
  • Guide to authors
  • Editorial policies

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

research data analysis michigan

Jump to navigation

Home

Michigan Mouse Metabolic Phenotyping Center in Live Models

  • Publications

Search form

Research asistance: customized data analysis, test number, service provided by, please help us continue our mission to advance the research for mouse models of diabetes, diabetic complications, obesity and related metabolic diseases and conditions by acknowledging our grant number u2cdk135066  in publications resulting from services provided by the mouse metabolic phenotyping center - live at michigan..

Jump to navigation

Search form

Home

The Graduate School

  • Faculty/Staff Resources
  • Programs of Study Browse the list of MSU Colleges, Departments, and Programs
  • Graduate Degree List Graduate degrees offered by Michigan State University
  • Research Integrity Guidelines that recognize the rights and responsibilities of researchers
  • Online Programs Find all relevant pre-application information for all of MSU’s online and hybrid degree and certificate programs
  • Graduate Specializations A subdivision of a major for specialized study which is indicated after the major on official transcripts
  • Graduate Certificates Non-degree-granting programs to expand student knowledge and understanding about a key topic
  • Interdisciplinary Graduate Study Curricular and co-curricular opportunities for advanced study that crosses disciplinary boundaries
  • Theses and Dissertations Doctoral and Plan A document submission process
  • Policies and Procedures important documents relating to graduate students, mentoring, research, and teaching
  • Academic Programs Catalog Listing of academic programs, policies and related information
  • Traveling Scholar Doctoral students pursue studies at other BTAA institutions
  • Apply Now Graduate Departments review applicants based on their criteria and recommends admission to the Office of Admissions
  • International Applicants Application information specific to international students
  • PhD Public Data Ph.D. Program Admissions, Enrollments, Completions, Time to Degree, and Placement Data
  • Costs of Graduate School Tools to estimate costs involved with graduate education
  • Recruitment Awards Opportunities for departments to utilize recruitment funding
  • Readmission When enrollment is interrupted for three or more consecutive terms
  • Assistantships More than 3,000 assistantships are available to qualified graduate students
  • Fellowships Financial support to pursue graduate studies
  • Research Support Find funding for your research
  • Travel Funding Find funding to travel and present your research
  • External Funding Find funding outside of MSU sources
  • Workshops/Events Find opportunities provided by The Graduate School and others
  • Research Opportunities and programs for Research at MSU
  • Career Development Programs to help you get the career you want
  • Graduate Educator Advancement and Teaching Resources, workshops, and development opportunities to advance your preparation in teaching
  • Cohort Fellowship Programs Spartans are stronger together!
  • The Edward A. Bouchet Graduate Honor Society (BGHS) A national network society for students who have traditionally been underrepresented
  • Summer Research Opportunities Program (SROP) A gateway to graduate education at Big Ten Academic Alliance universities
  • Alliances for Graduate Education and the Professoriate (AGEP) A community that supports retention, and graduation of underrepresented doctoral students
  • Recruitment and Outreach Ongoing outreach activities by The Graduate School
  • Diversity, Equity, and Inclusion Funding Funding resources to recruit diverse students
  • Graduate Student Organizations MSU has over 900 registered student organizations
  • Grad School Office of Well-Being Collaborates with graduate students in their pursuit of their advanced degree and a well-balanced life
  • Housing and Living in MI MSU has an on and off-campus housing site to help find the perfect place to stay
  • Mental Health Support MSU has several offices and systems to provide students with the mental health support that they need
  • Spouse and Family Resources MSU recognizes that students with families have responsibilities that present challenges unique to this population
  • Health Insurance Health insurance info for graduate student assistants and students in general at MSU
  • Safety and Security MSU is committed to cultivating a safe and inclusive campus community characterized by a culture of safety and respect
  • Why Mentoring Matters To Promote Inclusive Excellence in Graduate Education at MSU
  • Guidelines Guidelines and tools intended to foster faculty-graduate student relationships
  • Toolkit A set of resources for support units, faculty and graduate students
  • Workshops Workshops covering important topics related to mentor professional development
  • About the Graduate School We support graduate students in every program at MSU
  • Strategic Plan Our Vision, Values, Mission, and Goals
  • Social Media Connect with the Graduate School!
  • History Advancing Graduate Education at MSU for over 25 years
  • Staff Directory
  • Driving Directions

Content of a Statistical Analysis Plan

Graduate Students

Event Date:  Friday, May 24 from noon to 1 p.m.

Developing a statistical analysis plan (SAP) is a critical component of designing and conducting research studies which influences collection, analysis, and interpretation of data.

Learning objectives:

  • Understand that the SAP promotes rigor and reproducibility.
  • Understand the contents of an SAP (administrative information, background, study level and variable level information, analysis methods, amendments).
  • Learn about structured reporting and estimates.

This seminar repeats the topics from the workshop "Developing your own statistical analysis plan" and thus is intended for participants who had not taken the earlier workshop.

This seminar is eligible for RECR credit. Learn more and register here .

Michigan State University Wordmark

  • Call us: (517) 353-3220
  • Contact Information
  • Privacy Statement
  • Site Accessibility
  • Call MSU: (517) 355-1855
  • Visit: msu.edu
  • MSU is an affirmative-action, equal-opportunity employer.
  • Notice of Nondiscrimination
  • Spartans Will.
  • © Michigan State University

Child Care and Early Education Research Connections

Cost savings of early childhood education in michigan: the value of getting one child ready for kindergarten, summary.

This report uses the most recently available data and methods to update previous estimates of the cost savings of early childhood education in Michigan. This is a two-page summary of the full report. (author abstract)

- Related Resources

Related resources include summaries, versions, measures (instruments), or other resources in which the current document plays a part. Research products funded by the Office of Planning, Research, and Evaluation are related to their project records.

Cost savings of early childhood education in Michigan: The value of getting one child ready for kindergarten

- you may also like.

These resources share similarities with the current selection.

Cost savings analysis of school readiness in Michigan

Cost savings analysis of school readiness in michigan [executive summary].

  • Open Data Portal
  • Labor Market Information Labor Market Information collapsed link
  • Michigan's Labor Market News
  • Michigan Labor Market Data
  • Labor Market Information

Search is currently unavailable. Please try again later.

Popular on michigan.gov

  • Agriculture and Rural Development
  • Civil Rights
  • Environment
  • Health and Human Services
  • Natural Resources
  • Secretary of State

How Do I...

  • Register to Vote
  • Renew My License Plate
  • View assistance programs

The web Browser you are currently using is unsupported, and some features of this site may not work as intended. Please update to a modern browser such as Chrome, Firefox or Edge to experience all features Michigan.gov has to offer. 

  • Google Chrome
  • Microsoft Edge

Michigan Workforce Participation Increases Over the Year Across All Age Groups

May 13, 2024

Reviewing labor market trends among different demographic populations can highlight disparities among these groups. Demographic data for all states are published as an annual average by the U.S. Bureau of Labor Statistics once a year. This data comes from the Current Population Survey (CPS), which is a different source than the standard monthly labor market information containing Michigan’s official unemployment rate, labor force, and payroll jobs. Comparisons should not be made with monthly data, as the demographic information presented here is a 12-month average for Michigan and the U.S.

For this analysis, Michigan’s workforce data from the Current Population Survey was divided into three age cohorts: young adults, or those ages 16 to 24; the prime age adult workforce, ages 25 to 54; and older adults, workers age 55 or older. The 12-month moving average from March 2024 showed that young adults made up 14.8 percent of the total workforce, prime aged adults made up 61.6 percent of the workforce, and older adults age 55 and over accounted for approximately 23.5 percent of Michigan’s working population.

Unemployment Rates and Labor Force Participation Rates by Age Group, March 2023 vs. March 2024

(12-month Annual Averages)

Michigan unemployment rates and labor force participation rates by age group March 2023 March 2024

Source: Current Population Survey, U.S. Bureau of Labor Statistics

View/Download Data Table

Unemployment Rates for Young and Prime Aged Workers Decreased since March 2023

A comparison of Michigan 12-month moving average unemployment rate data demonstrated that unemployment rates decreased between March 2023 and March 2024 for the two cohorts comprising of those ages 16 to 54. Jobless rates declined by 0.7 percentage points for both young adults and those of prime working age. For those age 55 or older, the unemployment rate advanced by half a percentage point over the year. The national unemployment rate for young adults during March 2024 was comparable to the 12-month average rate seen in Michigan (8.6 percent versus 8.5 percent). Michigan’s adult workforce jobless rate, at 2.7 percent, was 0.7 percentage points lower than the nation’s adult rate. In contrast, the unemployment rate for those 55 and older was 0.4 percentage points higher in the state than the national rate of 2.7 percent.

An examination of Michigan unemployment rates by age comparing 12-month moving average data from March 2023 to March 2024 demonstrated that young adults experienced the largest monthly jobless rates out of all three cohorts, although the rate dipped from May to August of 2023 as young men and women found employment during the summer months. Jobless rates remained relatively stable for the adult workforce during 2023, then fell below 3.0 percent from December 2023 to March 2024. Joblessness for the older adult workforce expanded during the same time period, surpassing the adult cohort jobless rate from November 2023 onwards.

Unemployment rates have gone up slightly for older workers since March 2023.

Labor force participation rose for all three age groups over the year.

Workforce participation advanced in all three cohorts in Michigan between March 2023 and March 2024. For the prime-aged adult population, labor force participation rose by 3.7 percentage points over the year. Labor force participation among younger adults advanced by 3.3 percentage points during the same period. Participation among older adults advanced the least out of the three age groups, increasing by 0.8 percentage points since March 2023.

From March 2023 to March 2024, 12-month moving average workforce participation rates remained relatively stable for all three age groups. During March 2024, the statewide average workforce participation rate for young adults and adults ages 25 to 54 surpassed labor force participation rates for both cohorts on the national level. U.S. workforce participation for those age 55 or older was 1.2 percentage points above the 12-month moving average rate for older adults in the state (38.7 versus 37.5 percent).

All three age groups demonstrate labor force participation growth since March 2023.

Shibani Putatunda

Economic Specialist

Related News

ORIGINAL RESEARCH article

Urban green spaces and resident health: an empirical analysis from data across 30 provinces in china.

Yan Bi

  • Chengdu University of Technology, Chengdu, Sichuan Province, China

The final, formatted version of the article will be published soon.

Select one of your emails

You have multiple emails registered with Frontiers:

Notify me on publication

Please enter your email address:

If you already have an account, please login

You don't have a Frontiers account ? You can register here

Background: This study aims to explore the correlation between urban green space coverage and resident health, and to analyze its underlying mechanisms. Methods: Using panel data from 30 provinces in China from 2006 to 2022, which mainly includes urban green space coverage, general health of the population, air quality, and social connectivity. This research constructed a fixed effects model to perform baseline regression analysis. A series of robustness tests, including variable substitution, controlling for geographical differences, regional robustness tests, and shortening the time span of the study, further verified the robustness of the results. Additionally, mechanism tests were conducted to examine the positive impacts of urban green spaces on resident health by improving air quality and enhancing social connectivity. Results: The findings indicate a significant positive correlation between urban green space coverage and resident health levels. That is, the greater the area covered with urban green space, the healthier the residents of the area will be. Robustness tests support the reliability of this finding, while mechanism analysis reveals that urban green spaces have a positive impact on the health of the population by improving air quality and increasing social connectivity. Discussion: This study underscores the importance of urban green space planning in improving resident health and quality of life, providing urban planners with scientific evidence to optimize urban green systems for broader health objectives.

Keywords: Urban green spaces, Resident health, Air Quality, Social Connectivity, urban planning

Received: 29 Apr 2024; Accepted: 16 May 2024.

Copyright: © 2024 Bi, Wang, Yang, Mao and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Ya Wang, Chengdu University of Technology, Chengdu, 610059, Sichuan Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

The green IT revolution: A blueprint for CIOs to combat climate change

Companies and governments looking to combat climate change are turning to tech for help. AI, new technologies, and some promising tech-driven business models have raised hopes for dramatic progress.

About the authors

This article is a collaborative effort by Gerrit Becker, Luca Bennici, Anamika Bhargava, Andrea Del Miglio , Jeffrey Lewis , and Pankaj Sachdeva, representing views from McKinsey Technology.

While many organizations’ climate goals are lofty, enterprise technology leaders—CIOs, chief digital innovation officers (CDIOs), and chief technology officers (CTOs), among others—have not always succeeded at turning climate ambitions into reality. One of the biggest reasons is that hard facts and clear paths of action are scarce. Misconceptions and misinformation have clouded the picture of what CIOs and tech leaders should do.

We have done extensive analysis of where technology can have the biggest impact on reducing emissions. To start, we divided technology’s role into two primary types of activities:

  • offense—the use of technology and analytics to cut emissions by reducing (improving operational efficiency), replacing (shifting emission-generating activities to cleaner alternatives), and reusing (recycling material)
  • defense—the actions IT can take to reduce emissions from the enterprise’s technology estate

Scope of the McKinsey analysis

McKinsey’s emissions analysis for this report focuses on enterprise technology emissions, which are the business IT emissions from the hardware, software, IT services, enterprise communications equipment, mobile devices, fixed and mobile network services, and internal technology teams that a company uses for its own operations and that a CIO has control over. These include the emissions related to the full life cycles of the products and services that an enterprise IT function uses, including their development, delivery, usage, and end of life (exhibit). Our internal services emissions' analysis assumes around 40 percent of IT workers are working from home.

The analysis does not include the emissions from the technology products and services that a company is selling (such as data center capacity sold by hyperscalers), operational technology devices (such as sensors and point-of-sale systems), and cryptocurrency mining.

The defense activities are where the CIO, as the head of IT, can act independently and quickly. This article focuses on defense, specifically the IT elements over which a CIO has direct control. We examined emissions from use of electricity for owned enterprise IT operations, such as the running of on-premises data centers and devices (classified as scope 2 by the Greenhouse Gas Protocol 1 Greenhouse Gas Protocol: Technical Guidance for Calculating Scope 3 Emissions: Supplement to the Corporate Value Chain (Scope 3) Accounting & Reporting Standard , World Resources Institute & World Business Council for Sustainable Development, 2013. Scope 1 emissions are direct emissions from the activities of an organization or under their control, including fuel combustion on site such as gas boilers, fleet vehicles, and air-conditioning leaks; scope 2 emissions are from electricity purchased and used by the organization; and scope 3 emissions are all indirect emissions not included in scope 2 that occur in the value chain of the reporting company, including both upstream and downstream emissions. ), and indirect emissions from technology devices that the CIO buys and disposes of (scope 3). 2 These calculations do not include emissions from technology-driven services sold, such as cloud capacity. (See sidebar, “Scope of the McKinsey analysis.”)

What the facts say

Our analysis has uncovered several facts that contravene some commonly held views about enterprise technology emissions. These facts involve the significant amount of tech-related emissions, the share of emissions from end-user devices, the variety of mitigation options available, and the favorable impact of shifting to cloud computing.

Enterprise technology generates significant emissions

Enterprise technology is responsible for emitting about 350 to 400 megatons of carbon dioxide equivalent gases (CO 2 e), accounting for about 1 percent of total global greenhouse gas (GHG) emissions. At first blush, this might not seem like a lot, but it equals about half of the emissions from aviation or shipping and is the equivalent of the total carbon emitted by the United Kingdom.

The industry sector that contributes the largest share of technology-related scope 2 and scope 3 GHG emissions is communications, media, and services (Exhibit 1). Enterprise technology’s contribution to total emissions is especially high for insurance (45 percent of total scope 2 emissions) and for banking and investment services (36 percent).

This amount of carbon dioxide and equivalent gases is a significant prize for companies under increasing pressure to cut emissions. Progress on climate change requires action on many fronts, and enterprise technology offers an important option that CIOs and companies can act on quickly.

Taking a photo of bamboo forest - stock photo

You’re invited

To a McKinsey Technology webinar on the critical role of technology in building a sustainable enterprise on October 25, 9:30–10:30am ET.

The biggest carbon culprit is end-user devices, not on-premises data centers

End-user devices—laptops, tablets, smartphones, and printers—generate 1.5 to 2.0 times more carbon globally than data centers (Exhibit 2). 3 On-premises and co-located data centers used by enterprises, not including data center capacity sold by hyperscalers. One reason is that companies have significantly more end-user devices than servers in on-premises data centers. In addition, the devices typically are replaced much more often: smartphones have an average refresh cycle of two years, laptops four years, and printers five years. On average, servers are replaced every five years, though 19 percent of organizations wait longer. 4 Rhona Ascierto and Andy Lawrence, Uptime Institute global data center survey 2020 , Uptime Institute, July 2020.

More worrisome, emissions from end-user devices are on track to increase at a CAGR of 12.8 percent per year. 5 End-user computing market: Growth, trends, COVID-19 impact, and forecasts (2022–2027) , Mordor Intelligence, January 2022. Efforts to address this could target the major causes of emissions from these devices. About three-fourths of the emissions comes from manufacturing, upstream transportation, and disposal. A significant source of these emissions is the semiconductors that power the devices.

Plenty of low-cost/high-impact options exist, starting with improved sourcing

We have found that when it comes to going green, many CIOs think in terms of investments needed to replace items or upgrade facilities. Our analysis, however, finds that CIOs can capture significant carbon benefits without making a significant investment—and in some cases can even save money (Exhibit 3).

Overall, for example, 50 to 60 percent of emissions related to end-user devices can be addressed through sourcing changes, primarily by procuring fewer devices per person and extending the life cycle of each device through recycling. These options will not require any investment and will lower costs, though companies may want to evaluate the impact on employee experience.

In addition, companies can more aggressively recycle their devices; 89 percent of organizations recycle less than 10 percent of their hardware overall. 6 Sustainable IT: Why it’s time for a green revolution for your organization’s IT , Capgemini Research Institute, 2021. CIOs can put pressure on suppliers to use greener devices, especially as companies in the semiconductor sector are already increasing their commitments to emission reduction. Further low-cost, high-impact actions include optimizing business travel and data center computing needs, as well as increasing the use of cloud to manage workloads.

Moving to cloud has more impact than optimizing data centers

Optimizing an on-premises data center’s power usage effectiveness (PUE) 7 PUE describes how efficiently a computer data center uses energy, expressed as the ratio of total facility energy to IT equipment energy. is expensive and results in limited carbon abatement. If a company were to double what it spends on infrastructure and cloud to reduce PUE, it would cut carbon emissions by only 15 to 20 percent. Structural improvements in data centers and optimized layout can help, but the impact is limited, and many companies have already implemented them. More aggressive measures, such as moving data centers to cooler locations or investing in new cooling tech, are prohibitively expensive.

A more effective approach is to migrate workloads to the cloud. Hyperscalers (also known as cloud service providers) and co-locators are investing significantly to become greener through measures such as buying green energy themselves and investing in ultra-efficient data centers with a PUE equal to or less than 1.10, compared with the average PUE of 1.57 for an on-premises data center. 8 “Uptime Institute 11th annual Global Data Center Survey shows sustainability, outage, and efficiency challenges amid capacity growth,” Uptime Institute, September 14, 2021. (We estimate that companies could achieve just a 1.3 PUE score for their data center if they invested nearly 250 percent more, on average, over what they currently spend for their data centers and cloud presence.)

With thoughtful migration to and optimized usage of the cloud, companies could reduce the carbon emissions from their data centers by more than 55 percent—about 40 megatons of CO 2 e worldwide, the equivalent of the total carbon emissions from Switzerland.

Three steps to take now

With companies and governments under intensifying pressure to cut carbon emissions and with technology playing a key role in delivering on those goals, CIOs will find themselves on the front lines. The challenge will be to reduce IT’s carbon footprint while delivering high-quality, low-cost technology services to customers and employees.

On average, completion of the defensive steps might take three to four years. However, CIOs who act decisively and precisely can achieve 15 to 20 percent of carbon reduction potential in the first year with minimal investment.

CIOs can choose from among a wide array responses, particularly in conjunction with the CEO and the board. However, three measures they can take right now will prepare the organization for longer-term efforts. These measures involve sourcing strategies, key metrics, and a performance management system.

Map of the world designed in flowers

The net-zero transition: What it would cost, what it could bring

Move now on sourcing strategies.

Far and away the fastest and most effective defensive measure for reducing IT carbon emissions is to revise policies for technology sourcing. Optimizing the number of devices in line with standards followed by companies in the top quartile 9 Top quartile in terms of the ratio of devices to people is derived from the number of devices per person. Our analysis uses McKinsey Digital’s Ignite solutions and 2020 data. would reduce about 30 percent of end-user-device emissions, the amount of carbon emitted by Hong Kong. For example, top-quartile companies have one printer for every 16 people in the workplace; the overall average is one printer per eight people.

This sourcing shift does not necessarily lead to a degradation in user experience, because the rollout of 5G and increasingly advanced processing and compute power allow the main processing function to happen at the server. Therefore, devices can be less powerful and consume much less energy. Essentially, this is a software-as-a-service (SaaS) model where high-end and user-friendly experiences happen on the server, not the device. The effectiveness of this approach will depend on having stable networks, less resource-intensive coding at the device level, edge computing capabilities, and shifts of offerings to more efficient platforms (for example, cloud).

As part of this effort, the CIO and the business’s head of procurement will need to collaborate on reviewing and adjusting device refresh timelines and device-to-person ratios, as well as adjusting the basis for purchasing decisions. Procurement generally relies on cost/benefit calculations, and rightly so. That approach will need to expand to account for carbon dioxide emissions. The spirit of collaboration should extend to suppliers as well, with the parties working together to formulate plans that provide the greatest benefits for all.

A more thoughtful sourcing strategy extends beyond end-user devices. CIOs, for example, should look for green sources of the electricity IT uses. When these sources are unavailable, CIOs can direct procurement to power purchase agreements to offset carbon use. CIOs can also set green standards for their vendors and suppliers, requiring GHG emissions disclosures and incorporating them into their criteria for purchase decisions.

Establish a green ROI metric for technology costs

Any real progress on green technology can happen only when companies measure their “green returns.” But today, most green metrics omit cost and savings, which ultimately makes them impractical. A better metric focuses on cost per ton of carbon saved (accounting for costs saved as well). Sophisticated models calculate emissions throughout the full life cycle, including production, transportation, and disposal.

CIOs can further assess suppliers, manufacturers, and service providers based on how advanced they are in recycling and refurbishing electronics; designing circular components; extending product life cycles with better design, higher-quality manufacturing, and more robust materials; offering repair services; and reselling to consumers.

Decisions about IT spending need to consider a range of factors, including technical debt abatement and business strategy. Along with these factors, companies should institutionalize a green ROI metric that is transparent to everybody in the business as an element in IT decision making, including in requests for proposals (RFPs). Doing so will enable companies to better understand the true impact their technology is having on carbon emissions.

Put in place green measurement systems

Establishing a green ROI metric is only a start. CIOs need to establish a baseline of performance, measure progress against the baseline, and track impact in near real time, much as companies track real-time computer and network usage for applications in the cloud. This kind of measuring system ensures that CIOs know what’s working and what isn’t, so they can adjust quickly.

In practice, implementing green measurement can be challenging. Some companies have spent a year measuring their carbon footprint, ending up with an outdated analysis. This tends to happen when companies are determined to measure every bit of carbon emitted, a praiseworthy but time-consuming effort. CIOs can make substantial progress by instead prioritizing measurement where the impact is highest, such as tracking the number of end-user devices purchased and in use, the current duration of use for each device, and the ratio of devices per user. Another way CIOs can make quick progress is to embed emissions- and power-monitoring capabilities into large technology assets and work with external providers, such as electricity companies, to track usage in real time.

Effectively combating climate change won’t happen through one or two big wins; those don’t exist yet. To have real impact, companies and governments will need to act in many areas. Technology has a huge role to play in many of these areas, but CIOs and tech leaders need to act quickly and decisively.

This article is the first in a series about how CIOs can reduce emissions. The next article will explore how CIOs can drive the business’s sustainability agenda by playing offense and implementing reduce, replace, and reuse levers to decarbonize.

Gerrit Becker is an associate partner in McKinsey’s Frankfurt office, Luca Bennici is an associate partner in the Dubai office, Anamika Bhargava is a consultant in the Toronto office, Andrea Del Miglio is a senior partner in the Milan office, Jeffrey Lewis is a senior partner in the New Jersey office, and Pankaj Sachdeva is a partner in the Philadelphia office.

The authors wish to thank Bernardo Betley, Arjita Bhan, Raghuvar Choppakatla, Sebastian Hoffmann, Abdelrahman Mahfouz, Tom Pütz, Jürgen Sailer, Tim Vroman, Alice Yu, and Gisella Zapata for their contributions to this article.

Explore a career with us

Related articles.

Worker walking in the factory

It’s not easy buying green: How to win at sustainable sourcing

" "

Sustainability in semiconductor operations: Toward net-zero production

Futuristic organic sphere

Delivering the climate technologies needed for net zero

IMAGES

  1. Standard statistical tools in research and data analysis

    research data analysis michigan

  2. Research Data

    research data analysis michigan

  3. 5 Steps of the Data Analysis Process

    research data analysis michigan

  4. Unleashing Insights: Mastering the Art of Research and Data Analysis

    research data analysis michigan

  5. Quantitative research tools for data analysis

    research data analysis michigan

  6. Tools for data analysis in research methodology

    research data analysis michigan

VIDEO

  1. Analysis of Data? Some Examples to Explore

  2. Welcome to the World of Generative Artificial Intelligence

  3. Data analytics drill-down

  4. Applied Survey Research Data Analysis Correlations

  5. Qualitative Research Data Analysis University of Indianapolis OTD 551 Research Analysis course

  6. MS in Data Analytics and Program Evaluation: Building Real-World Skills for Career Growth

COMMENTS

  1. Michigan Center for Data and Analytics

    Through data, reporting, and analysis, the MCDA provides partners with the information and insights they need to make evidence-based decisions. Insights. Insights. Read the latest research from the Michigan Center for Data and Analytics. Read Insights. Michigan Open Data Portal. View and analyze data provided by a variety of Michigan state ...

  2. Data Science, Analytics, and Visualization

    Analysis, management, and display of rich data sources, and computational social science ... Research; Research areas; Data Science, Analytics, and Visualization Data Science, Analytics, and Visualization ... School of Information University of Michigan 105 S State St. Ann Arbor, ...

  3. Research Analysis & Data Integration Office

    Welcome to the Research Analysis & Data Integration Office (RADIO) at the University of Michigan. Harnessing the power of data to inform and elevate research and creative practice pursuits across campus, RADIO is U-M's central hub supporting data-driven decision-making about the research enterprise. From sophisticated integration of multiple ...

  4. MIDAS

    June 17, 2024 - June 21, 2024. Introduction to Data Science and AI High School Summer Camp 2024. View More. The Michigan Institute for Data Science strengthens the University of Michigan's preeminence in Data Science and Artificial Intelligence and enables their transformative use in a wide range of research disciplines to achieve lasting ...

  5. Master of Science in Survey and Data Science

    "Michigan's survey and data science graduate program has an extraordinary place in my heart and memories. The graduate program helped me to expand my knowledge in quantitative methods and statistics and gain a more holistic perspective about survey research."-Iago Santos Muraro, MS in Survey and Data Science, 2018

  6. ICPSR Data Excellence Research Impact

    ICPSR is research science data and resources on topics like social media, politics, economics, social sciences, government, GIS, & more.

  7. Major in Data Science

    The Data Science major in LSA consists of a total of 42 required credit hours, not including pre-requisites or pre-major courses.All courses must be completed with a minimum grade of C. Note that the EECS department limits students to two attempts for EECS 203, EECS 280, and EECS 281. Data Science Program Guide (Effective for declarations Fall 23' or prior)

  8. Data Analytics

    Data science research uses principles from computation, machine learning, statistics, and mathematics, to develop methods to analyze data and gain insight and knowledge about underlying systems to improve decision making. Data analytics is highly interdisciplinary with significant overlap with many areas, including human system integration, optimization, and stochastic systems. This area ...

  9. Data Research Analyst

    This dynamic role serves as a bridge between data and analysis teams and combines knowledge of research methods and data analysis skills. ... Michigan Program in Survey and Data Science Institute for Social Research 426 Thompson Street, Room 4050 Ann Arbor, MI 48104-1248 ...

  10. About CSCAR

    About CSCAR. Consulting for Statistics, Computing and Analytics Research (CSCAR) provides individualized support and training to University of Michigan researchers in a variety of areas relating to the management, collection, and analysis of data. CSCAR also supports the use of technical software and advanced computing in research.

  11. Survey Data Collection and Analytics

    In six courses, you will learn the basics of questionnaire design, data collection methods, sampling design, dealing with missing values, making estimates, combining data from different sources, and the analysis of survey data. Faculty for this specialisation comes from the Michigan Program in Survey Methodology and the Joint Program in Survey ...

  12. Data Science for Health Research

    In Data Science for Health Research, learn to organize and visualize health data using statistical analysis in programs like R. Explore how to translate data, interpret statistical models, and predict outcomes to help make data-informed decisions within the public health field. Subject. Data Science Language. English. Duration. 10 weeks. Status

  13. Research Data Analytics

    Research Portfolio Online Reporting Tools (RePORT) provide access to reports, data, and analyses of NIH research activities, including information on NIH expenditures - even categorical spending by disease or condition and the results of NIH supported research. ... Grant Services & Analysis is a unit of the Medical School Office of Research ...

  14. Undergraduate Research

    An honors thesis provides an opportunity for eligible students to carry out faculty-supervised research in the senior year. The application process and requirements for the Statistics, Data Science, and Informatics honors programs are described on the department website. Students are encouraged to contribute their thesis to the archive of honors theses at the University of Michigan Library.

  15. Data Science, Biostatistics, and Informatics Core

    The Data Science, Biostatistics, and Informatics Core is supported by the Statistics Online Computational Resources (SOCR). The Core provides expertise in experimental design, data management, information and data science, health analytics, mathematical modeling, and statistical computing for preclinical, clinical, and translational research studies conducted by the University of Michigan ...

  16. Clinical Research Design and Statistical Analysis

    The Clinical Research Design and Statistical Analysis program (OJOC CRDSA) is now accepting applications for the cohort starting in Fall 2022. ... Hands-on data analysis and presentation using standard computer software for linear and nonlinear analysis will be emphasized. Course goals include the ability to formulate and evaluate a model, to ...

  17. Bioinformatics Data Analysis

    We provide analysis of high-throughput sequencing data, including RNA-Seq, ChIP-Seq, and Exome-Seq, as well as other high throughput data. ... School Office of Research, where our mission is to foster an environment of innovation and efficiency that serves the Michigan Medicine research community and supports biomedical science from insight to ...

  18. Find Data

    Topics / Series / Thematic data collections / Data-related publications. Studies for which online analysis is available. Self-published data, including replication datasets. Studies with learning guides. New/Updated Data Releases: In the last week. In the last month. In the last quarter. In the last year.

  19. Data Tools and Analysis Archives

    Inter-University Consortium for Political and Social Research Virtual Data Enclave (ICPSR) maintains a data archive of more than 250,000 files of research in the social and behavioral sciences. Stuart Hutchings. 734-615-7362. [email protected].

  20. My Learning Analytics Research & Data

    This means that researchers will access several types of data in doing this analysis. Data. Canvas course; Student use of MyLA (clickstream data) Student demographic information; Access to this data falls under U-M's data governance and has been reviewed and approved by data stewards. An IRB exemption (HUM00036927) has been granted.

  21. Crash and Driving Data Analysis

    (2017) Societal Costs of Traffic Crashes and Crime in Michigan: 2017 Update (UMTRI-2017-01). Ann Arbor, MI: University of Michigan Transportation Research Institute. Newnam S, Blower D, Molnar LJ, Eby DW, and Koppel S. (2018) Exploring crash characteristics and injury outcomes: an analysis of truck-involved crash data in the US.

  22. A dataset for measuring the impact of research data and their ...

    This paper introduces a dataset developed to measure the impact of archival and data curation decisions on data reuse. The dataset describes 10,605 social science research datasets, their curation ...

  23. Research Asistance: Customized Data Analysis

    Building 25, Office 3682, North Campus Research Complex, 2800 Plymouth Road, Ann Arbor, MI 48109-2800 | Phone: (734) 647-2801 | Contact: Carol Elias

  24. Content of a Statistical Analysis Plan

    ATTENTION:Graduate StudentsEvent Date: Friday, May 24 from noon to 1 p.m.Developing a statistical analysis plan (SAP) is a critical component of designing and conducting research studies which influences collection, analysis, and interpretation of data.Learning objectives:Understand that the SAP promotes rigor and reproducibility.Understand the contents of an SAP (administrative information ...

  25. Cost savings of early childhood education in Michigan: The value of

    This report uses the most recently available data and methods to update previous estimates of the cost savings of early childhood education in Michigan. This is a two-page summary of the full report. (author abstract) ... Assessing Research Quality; Study Design and Analysis; Experiments and Quasi-Experiments; Descriptive Statistics;

  26. demographic-analysis-mar-2024

    For this analysis, Michigan's workforce data from the Current Population Survey was divided into three age cohorts: young adults, or those ages 16 to 24; the prime age adult workforce, ages 25 to 54; and older adults, workers age 55 or older. The 12-month moving average from March 2024 showed that young adults made up 14.8 percent of the ...

  27. AI strategy in business: A guide for executives

    Yuval Atsmon: When people talk about artificial intelligence, they include everything to do with analytics, automation, and data analysis. Marvin Minsky, the pioneer of artificial intelligence research in the 1960s, talked about AI as a "suitcase word"—a term into which you can stuff whatever you want—and that still seems to be the case.

  28. Urban Green Spaces and Resident Health: An Empirical Analysis from Data

    Methods: Using panel data from 30 provinces in China from 2006 to 2022, which mainly includes urban green space coverage, general health of the population, air quality, and social connectivity. This research constructed a fixed effects model to perform baseline regression analysis.

  29. The green IT revolution: A blueprint for CIOs

    Hyperscalers (also known as cloud service providers) and co-locators are investing significantly to become greener through measures such as buying green energy themselves and investing in ultra-efficient data centers with a PUE equal to or less than 1.10, compared with the average PUE of 1.57 for an on-premises data center. 8 "Uptime ...