Resources, Wk. 3-6 (and beyond)

Resources for Weeks 3-6 (and beyond)

Tutorials

Processing different file types

Regex

Topic modelling

Guidance with methods

Topic modelling options

Topic modelling with Mallet

Using LLMs

Text analysis: workflow and programming tips

Digital Publications

Youtube tutorials

Large Language Models

Creating public-facing websites

  • Livemark: “Data presentation framework for Python that generates static sites from extended Markdown with interactive charts, tables, scripts, and other features.”
  • Jupyter Book: More for long-form publications, but could be of interest. Melanie Walsh’s textbook was built using Jupyter Book. Now there is also version 2.

Extra readings

These readings didn’t quite fit into our 2-week instruction schedule but could still be of interest!

Books and book chapters

Articles

  • Kusumegi, Keigo, and Yukie Sano. “Dataset of Identified Scholars Mentioned in Acknowledgement Statements.” Scientific Data, vol. 9, no. 1, Aug. 2022, p. 461. www-nature-com.proxy.library.cornell.edu, https://doi.org/10.1038/s41597-022-01585-y.

  • Yin, Yian, et al. “Coevolution of Policy and Science during the Pandemic.” Science, vol. 371, no. 6525, Jan. 2021, pp. 128–30. DOI.org (Crossref), https://doi.org/10.1126/science.abe3084.