Important Dates

  • 04/04/2024: Journal article selected.
  • 04/25/2024: Progress report due.
  • 05/07/2024 (Section 001) or 05/08/2024 (Section 002):
    • Final project presentations.
    • Written report due.
    • Code due.
    • Peer assessment due.
    • All of this is due before the final project presentations (not on the following Thursday).

Grade Breakdown

  • 2 pts: Article/data Selection
  • 10 pts: Progress Report
  • 5 pts: Write-up
  • 5 pts: Presentation
  • 5 pts: Code
  • 2 pts: Peer assessment

General Description

  • For the final project you will work in teams of 2–4 to select a journal article from 2023 or later and reproduce their results.

Dataset (2 pts)

Criteria for an appropriate journal article are:

  1. Data are publicly available
  2. The authors’ code (R or otherwise) is not available.
  3. Authors used regression for analysis.
  4. You have not used these data before, and are not using them for anything else (including other classes) this semester.
  5. The data are from 2023 or later.
  6. No other group is using that data.
  7. It is a relatively complicated dataset. Ways that a dataset can be “relatively complicated” are:
    1. There are at \(\gtrapprox\) 10 predictor variables, or
    2. The authors use a method not taught in this class (e.g. principal components regression, lasso, etc).

I will let you propose a journal article with a dataset, but I retain the right to decide if the dataset is “relatively complicated”.

PLoS Journals are all required to have a data availability statement, so this is a good place to start searching.

You can get a sense of the type of articles I am looking for (or use one of these) based on the following that I picked out of a search from PLoS One. These aren’t necessarily good or interesting articles, just ones that satisfy the above requirements and were released when I checked for the newest releases in PLoS One. I don’t guarantee that these articles are fully reproducible. They just looked like the data were available and the methods were regression.

If the data are in a weird format (like “sav” or “dta”), then try to use the {haven} package to load it into R.

(Links at bottom of page in the References section)

  • Fu (2024)
  • Zhao (2024)
  • Frikha (2024)
  • Khanthik (2024)
  • Shinozuka (2024)
  • Wei (2024)
  • De Ridder (2024)
  • Vejřík (2024)
  • Villiers (2024)
  • Lares-Villaseñor (2024)
  • Li (2024)
  • Yasar (2024)
  • Wu (2024)
  • Zhang (2024)
  • Geng (2024)
  • Masters (2024)
  • Marsh (2024)
  • Wullems (2024)
  • Nairz (2024)

If you want to use one of the articles above, please provide me with rankings of your top five so I can apply Rank-maximal allocation to assign journal articles.

If you want to use one that is not on this list, please send me an email before the deadline so I can pre-approve it.

Progress Report (10 pts)

  • It often (not always) takes about two weeks to reproduce the results of a paper.

  • Sometimes, you find out right away that reproducing the paper is impossible, and you need to switch papers. This takes up time.

  • For the progress report you will turn in an R markdown file that loads in the data (that is already cleaned by some other script), fits a regression and reproduces the tables and figures in the paper you have selected.

  • If your numbers and figures are not the same as in the paper, your options are:

    1. Switch papers.
    2. Convince me that both (i) the results are impossible to reproduce with the given data and (ii) you can still get something interesting out of the data that you do have. If I’m not convinced of both of these, you’ll have to switch papers.
  • From past experience, students don’t believe me when I say it often (though, not always) takes two weeks to reproduce the results of a paper, so they start the day before the progress report is due and ask for extensions.

  • There will be no extensions on the progress report, even if you need to switch papers. Instead, there will be an upper bound on the grade of your progress report grade based on how late you turn it in.

    • < Two days late: 9 out of 10 pts max
    • < Four days late: 8 out of 10 pts max
    • < Six days late: 7 out of 10 pts max
    • < Eight days late: 6 out of 10 pts max
    • < Ten days late: 5 out of 10 pts max
    • More than ten days late: 0 out of 10 pts

Presentation/Written Report

You will provide a presentation and a written report. They should both follow this outline:

  • Introduce the problem/paper.
    • Very short. This is the least interesting part.
  • Describe the data.
  • State the model(s)/method(s)
  • Check assumptions.
  • Provide interpretation of results.
  • Critiques / expansion of analyses.
    • This is what I am most interested in. Please spend the most time on this.
    • It is OK if you think the authors did a great job and you have no improvements. You just have to justify this.
  • Summary/Conclusion
    • Very short. This is the least interesting part.

Presentation (5 pts)

  • Your presentation should be 12 minutes long. I will strict on time.

  • All group members should speak.

  • I will ask questions for 3 minutes at the end of the presentation.

Final Written Report (5 pts)

  • This should be an executive summary with lots of bullet points. Don’t make me read text not immediate relevant to the project. No yapping.
    • Summarize paper’s goals.
      • Very short. This is the least interesting part.
    • Summarize data.
    • Summarize model.
    • Check assumptions.
    • Interpret results.
    • Provide critiques
      • This is what I am most interested in. Please spend the most time on this.
    • Summarize your findings.
      • Very short. This is the least interesting part.
  • Limit your write-up to 4 pages, double spaced, 12 point font. This does not include figures/tables/references/code.

Code (5 pts)

  • You should turn in all of your code to reproduce the results of the paper in a single zipped folder.

  • I should be able to run your code without modification and obtain your results.

  • There should be no errors in your code.

  • You should use the methods/packages we’ve learned in class.

Peer Assessment (2 pts)

At the end of the project, I will have your teammates rate your contribution to the project. I will adjust your grade according to their comments. If they all say that you didn’t do anything, I will give you a failing grade.

You will get 2 points for filling out the assessment.

References

De Ridder, Steffen AND Scholtens, Adri AND Eriksen. 2024. “The Art of Valuation: Using Visual Analysis to Price Classical Paintings by Swedish Masters.” PLOS ONE 19 (1): 1–19. https://doi.org/10.1371/journal.pone.0296906.
Frikha, Nourhen AND Chaâri, Mohamed AND Mezghanni. 2024. “Towards Improving Online Learning in Physical Education: Gender Differences and Determinants of Motivation, Psychological Needs Satisfaction, and Academic Achievement in Saudi Students.” PLOS ONE 19 (2): 1–17. https://doi.org/10.1371/journal.pone.0297822.
Fu, Ghulam Rasool, Zhiyuan AND Madni. 2024. “Unveiling the Affecting Mechanism of Digital Transformation on Total Factor Productivity of Chinese Firms.” PLOS ONE 19 (2): 1–23. https://doi.org/10.1371/journal.pone.0298428.
Geng, Ningxue AND Ling, Puyang AND Fan. 2024. “The Perception of Mandarin Speech Conveying Communicative Functions in Chinese Heroin Addicts.” PLOS ONE 19 (2): 1–16. https://doi.org/10.1371/journal.pone.0299331.
Khanthik, Ngamjit AND Yaisawang, Apisit AND Kasetsuwan. 2024. “Factors Predicting the Visual Outcome of Intracorneal Ring Segment for Keratoconus.” PLOS ONE 19 (2): 1–13. https://doi.org/10.1371/journal.pone.0288181.
Lares-Villaseñor, Martha AND Salazar-García, Eunice AND Guevara-Cruz. 2024. “Genetic Risk Score for Insulin Resistance Based on Gene Variants Associated to Amino Acid Metabolism in Young Adults.” PLOS ONE 19 (2): 1–16. https://doi.org/10.1371/journal.pone.0299543.
Li, Dongli AND Liu, Yibo AND Yang. 2024. “The Impact of Environmental Education at Chinese Universities on College Students’ Environmental Attitudes.” PLOS ONE 19 (2): 1–16. https://doi.org/10.1371/journal.pone.0299231.
Marsh, Elvira AND Spence, Elizabeth AND Perez Vallejos. 2024. “Mindfully and Confidently Digital: A Mixed Methods Study on Personal Resources to Mitigate the Dark Side of Digital Working.” PLOS ONE 19 (2): 1–24. https://doi.org/10.1371/journal.pone.0295631.
Masters, Jessica L. AND Guillot, Allison K. AND Ward. 2024. “Comprehensive Characterization of the Effect of Mineralocorticoid Receptor Antagonism with Spironolactone on the Renin-Angiotensin-Aldosterone System in Healthy Dogs.” PLOS ONE 19 (2): 1–16. https://doi.org/10.1371/journal.pone.0298030.
Nairz, Alex AND Kiechl, Johannes AND Messner. 2024. “Determinants of Non-Alcoholic Fatty Liver Disease in Young People: Maternal, Neonatal, and Adolescent Factors.” PLOS ONE 19 (2): 1–15. https://doi.org/10.1371/journal.pone.0298800.
Shinozuka, Mizuho AND Hirayama, Masato AND Arai. 2024. “Peripapillary Vessel Density in Eyes with Cone-Rod Dystrophy.” PLOS ONE 19 (1): 1–8. https://doi.org/10.1371/journal.pone.0296167.
Vejřík, Ivana AND Sajdlová, Lukáš AND Vejříková. 2024. “A Non-Lethal Stable Isotope Analysis of Valued Freshwater Predatory Fish Using Blood and Fin Tissues as Alternatives to Muscle Tissue.” PLOS ONE 19 (1): 1–18. https://doi.org/10.1371/journal.pone.0297070.
Villiers, Claudia AND Thomas de, Olivia AND Elliot-Wilson. 2024. “Sleep and Cognition in South African Patients with Non-Functioning Pituitary Adenomas.” PLOS ONE 19 (1): 1–19. https://doi.org/10.1371/journal.pone.0296387.
Wei, Miao AND Zhang, Peng AND Ge. 2024. “Influence of Geographical Environment on Retinol-Binding Protein Reference Values in Chinese Men.” PLOS ONE 19 (1): 1–15. https://doi.org/10.1371/journal.pone.0297204.
Wu, Shengnan. 2024. “Application of Multimedia Technology to Innovative Vocational Education on Learning Satisfaction in China.” PLOS ONE 19 (2): 1–20. https://doi.org/10.1371/journal.pone.0298861.
Wullems, Hans AND Verschueren, Jorgen A. AND Degens. 2024. “Sedentary Behaviour (Especially Accumulation Pattern) Has an Independent Negative Impact on Skeletal Muscle Size and Architecture in Community-Dwelling Older Adults.” PLOS ONE 19 (2): 1–21. https://doi.org/10.1371/journal.pone.0294555.
Yasar, Pınar AND Sen, Burak AND Ozbilgehan. 2024. “Influence of Systemic Inflammatory Indices on Hospital Stay and Dialysis Post-Earthquake: A Clinical Study.” PLOS ONE 19 (2): 1–12. https://doi.org/10.1371/journal.pone.0299737.
Zhang, Xibin, Baoshan AND Dong. 2024. “The Unique Association Between Serum 25-Hydroxyvitamin d Concentrations and Blood Lipid Profiles in Agriculture, Forestry, and Fishing Occupations: Insights from NHANES 2001–2014.” PLOS ONE 19 (2): 1–19. https://doi.org/10.1371/journal.pone.0297873.
Zhao, Jingjing AND for the Alzheimer’s Disease Neuroimaging Initiative, Guanan AND Guan. 2024. “Sex-Dependent Association of Serum Uric Acid Levels with Amyloid Accumulation Among Amyloid-Positive Older Adults.” PLOS ONE 19 (2): 1–8. https://doi.org/10.1371/journal.pone.0296738.