Zachary Levonian's Blog — Index

View all my peer-reviewed research on my homepage. The paper that best represents who I am and what I value is “Peer Recommendation Interventions for Health-related Social Support”, which I published in the Computer Supported Cooperative Work conference in 2025.

This page lists my self-published writings, including 81 posts hosted on this blog, 4 hosted on Medium, and a few hosted elsewhere.

Posts

Feb 4, 2025
Overfitting to train/test splits
Ben Recht writes in a 2023 blog post that train-test splits are shockingly effective for evaluating machine learning models.
Sep 30, 2024
Two approaches to statistical modeling?
Three papers that break down the goals of statistical modeling.
Jun 26, 2024
Destroy 'AI', but keep designing with ML
Ali Alkhatib argues in a blog post that we should destroy AI. I see a few glimmers of hope.
May 24, 2024
Generating math hints with large language models
Large language models are pretty good at generating hints for students, given the right constraints.
Apr 16, 2024
Monitoring student safety during generative chats
Ensuring student safety with monitoring dashboards for students' LLM chats.
Feb 13, 2024
"What are the odds?" Stardew Valley edition
What's the probability of having exactly 111,111g in Stardew Valley?
Feb 2, 2024
Using retrieval-augmented generation to improve math question-answering
Humans prefer large language model responses to students' conceptual math questions when created with retrieval-augmented generation and "just the right amount" of prompting guidance.
Nov 26, 2023
Graphing Shakespeare and dramatizing data
Coupette et al.'s paper "All the world’s a (hyper)graph"
Nov 1, 2023
Designing LLM tutors to check for student understanding
What can Doug Lemov teach us about designing LLM tutors?
Oct 17, 2023
“Wishful Mnemonics” in Machine Learning Research
An always-relevant term from researcher Melanie Mitchell.
Aug 21, 2023
AI2 Dolma Most Frequent Words: a derivative dataset of the most common words
A dataset of word counts in Dolma.
Oct 29, 2021
Basics of machine learning with text data: bag-of-words for linear regression
Twitter data workshop presented to undergraduate HCI researchers.
Mar 15, 2019
NLP for Social Computing Workshop
Workshop presented to graduate HCI researchers.

Posts on Medium

May 9, 2022
How would you deal with an ambiguous problem?
Conceptual workflow for scoping Data Science problems.
Mar 9, 2022
Model vs Modeler: Trade-offs designing interactive text classification interfaces
Accessible summary of my IUI 2022 paper.
Nov 1, 2020
Integer Linear Programming with PuLP: Optimizing a DraftKings NFL lineup
Basic introduction to ILP with Python.
Nov 6, 2018
Prototyping a handheld with the Omega2: A complete beginner's guide
Tutorial for making a battery-powered handheld.

Short posts

Jul 10, 2025
I made it to 1000 ELO on Chess.com
And all it took was never practicing and playing a lot of games over lunch.
May 21, 2025
How does Wikipedia article quality impact decision-making?
A randomized experiment on Irish judges and a Wiki Workshop 2025 paper on the NFL draft.
Apr 23, 2025
Using Docker with pytest for integration testing
A few packages using Docker with Python's pytest for integration and functional testing.
Apr 4, 2025
Research paper: "Inference-Time Scaling for Generalist Reward Modeling"
DeepSeek paper introducing Self-Principled Critique Tuning for Generative Reward Modeling
Mar 27, 2025
Energy use of machine learning
Link round-up about the energy use of machine learning and large language models.
Mar 20, 2025
Thing precedes theory: motivating the design of new systems
Research through design, unmet needs, and other writing about system-building in human–computer interaction.
Mar 7, 2025
How I write peer reviews
My own process for writing peer reviews.
Feb 27, 2025
Learning webs: Ivan Illich & education technology
Ivan Illich is a controversial writer, but only if you consider destroying all schools controversial.
Feb 26, 2025
Introduction to Human–computer Interaction (HCI)
Resources for gaining familiarity with HCI.
Feb 25, 2025
Research paper: "Peer Recommendation Interventions for Health-related Social Support"
Can recommendation systems help people find health-related peer support online? I wrote a research paper exploring this question.
Feb 1, 2025
Non-linearities in floating point arithmetic
IEEE 754 floating point arithmetic is non-associative.
Jan 31, 2025
Reinforcement Learning Resources
A few resources on reinforcement learning.
Jan 30, 2025
Large language models as actors: Colin Fraser on alignment research
Colin Fraser wrote three Bluesky threads about problems in LLM alignment research.
Jan 27, 2025
Data sharing ethics at Crisis Text Line
Crisis Text Line is a non-profit.
Jan 17, 2025
Stochtree: a library for Stochastic tree ensembles (BART / XBART) for supervised learning and causal inference
Stochtree is a Bayesian Additive Regression Trees (BART) software library.
Jan 16, 2025
Downloading TikTok videos is easy with yt-dlp
TikTok's data export tool combined with yt-dlp makes it easy to download TikTok videos and metadata.
Nov 13, 2024
Prompting large language models
Resources for prompt engineering with large language models.
Nov 12, 2024
AI Safety: is there an existential risk?
A few notes on the existential risk posed by artifical intelligence.
Oct 15, 2024
Which academic journals should have articles on Wikipedia?
An interesting and long-running debate on English Wikipedia inclusion criteria.
Oct 7, 2024
Wrapping Python modules with Wrapt
The Python package Langfuse uses Wrapt to intercept calls to module functions.
Oct 4, 2024
Log-normal distributions in human behavioral data
Log-normal distrubitons appear a lot when measuring the time between human activities.
Sep 30, 2024
Asynchronous code in Python
A collection of links for learning about asynchronous development in Python.
Sep 29, 2024
Resources for building databases and data processing systems
A collection of links for learning about databases.
Sep 26, 2024
DVC: A tool for versioning data
DVC (Data Version Control) is a nifty tool.
Sep 11, 2024
Tutor Co-pilot: real-time scaffolding for math tutors
A project at PLUS using large language models for real-time support.
Jul 27, 2024
Research paper: Ideological differences in the expanse of the moral circle
2019 paper on political ideology and moral circles.
Jul 19, 2024
Research paper: When the implication is not to design (technology)
2011 CHI paper from Eric P.S. Baumer and Six Silberman.
Jul 18, 2024
Can you write a for loop in every programming language on your resume?
An amusing Hacker News comment.
Jul 17, 2024
Research paper: Multimodal, multi-class bias mitigation for predicting speaker confidence
Educational Data Mining (EDM) 2024 paper on predicting perceived speaker confidence.
Jun 21, 2024
Multi-threading for CPU inference in PyTorch
Exploring the performance characteristics of multi-threading for CPU inference with PyTorch.
May 23, 2024
Transcribing Zoom recordings with WhisperX on AWS Fargate
A Dockerized transcription and diarization pipeline using WhisperX.
May 19, 2024
Machine learning is tarmac
Fred Turner on how information technologies are like tarmac.
May 18, 2024
Interpretable machine learning with scoring models
Cynthia Rudin's research on scoring models.
May 17, 2024
Annotation, disagreement, and consensus
Collection of research related to annotator disagreement and consensus.
May 15, 2024
AWS Copilot CLI makes it easy to run containerized ETL jobs
I missed the release of AWS Copilot CLI back in 2020. It's a useful and opinionated tool.
May 13, 2024
Why can you use the pipe operator ('|') in LangChain?
LangChain lets you write chains using vertical pipes. How? By overriding __or__.
May 13, 2024
Complexity Sells
Eugene Yan argues that complexity sells. Not during AI winters!
Apr 17, 2024
Research paper: TnT-LLM
Short discussion of the 2024 paper "TnT-LLM: Text Mining at Scale with Large Language Models"
Apr 16, 2024
In-context learning: why does it work?
Why do prompting techniques based on in-context learning improve LLM performance?
Apr 11, 2024
Inference with predicted data
Inference with predicted data, including from text data.
Apr 9, 2024
Research paper: Inconsistent multiple testing corrections
Mark Rubin's position paper on statistical adjustments for multiple comparions.
Mar 21, 2024
Purposeful sampling in qualitative research
Notes on Patton's "Qualitative Research & Evaluation Methods"
Mar 13, 2024
Requirements elicitation for machine learning applications
Requirements elicitation for ML-backed designs can be challenging, through the lens of Qian Yang et al.'s CHI paper.
Mar 12, 2024
Research paper: "Red-Teaming for Generative AI: Silver Bullet or Security Theater?"
A 2024 position paper on red-teaming from Michael Feffer and others at CMU.
Mar 11, 2024
What makes a good short paper at CSCW?
Human-computer interaction and social computing contributions that make for good short papers
Mar 6, 2024
Book: Constructing Methodology for Qualitative Research
2015 book about qualitative research.
Mar 4, 2024
Interesting person: Allie Latimer
Allie Latimer founded Federally Employed Women (FEW).
Mar 1, 2024
Link Dump
Links to stuff I found interesting
Mar 1, 2024
Research paper: Label Sleuth
Shnarch et al.'s EMNLP 2022 paper "Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours"
Feb 28, 2024
Adding references to unreferenced Wikipedia articles
A February 2024 "edit-a-thon" shows the variety in unreferenced Wikipedia articles.
Feb 26, 2024
An interesting two envelopes problem
A probability question posted on Twitter.
Feb 15, 2024
Experimental design: Crowdsourced sources
Crowdsourced experimental design book recommendations.
Feb 14, 2024
"95% of people cannot solve this!"
A fun Quora post about a cruel meme.
Feb 12, 2024
Research paper: Why CSCW applications fail
A classic 1988 CSCW paper.
Feb 5, 2024
Book: Black Software
Charlton D. McIlwain's book Black Software.
Jan 31, 2024
Academic writing: avoid ambiguous antecedents
Avoid ambiguous antecedents, and a collection of academic writing tips from others.
Jan 30, 2024
My debugging process
My process for debugging software.
Jan 25, 2024
Connecting to GCP Cloud SQL from a local Docker container
Connecting to GCP Cloud SQL from a local Docker container using host.docker.internal
Jan 8, 2024
Skrub
Skrub is a cool and useful Python library.
Jan 2, 2024
Research idea: estimate the causal impact of sharing your true opinions on social media
A research/design opportunity for social media communication behaviors.
Jan 1, 2024
Research paper: A Random Sample of YouTube
McGrady et al.'s 2023 paper "Dialing for Videos: A Random Sample of YouTube"
Dec 20, 2023
The CommonLit Ease of Readability (CLEAR) Corpus
A dataset for advancing readability research.
Dec 16, 2023
NeurIPS'23 GAIED Source Round-up
Sources mentioned during the NeurIPS'23 Workshop on Generative AI for Education.
Dec 1, 2023
Converting PDFs to Markdown with Marker
A Python utility for converting PDFs to Markdown.
Dec 1, 2023
Blogroll
Blogs I find to be useful & informative.
Nov 27, 2023
Useful Links
Tutorials and other useful materials
Nov 25, 2023
Who is Zach?
Who is Zachary Levonian as a researcher and engineer?
Oct 13, 2023
Jekyll on GitHub Pages
How this blog is configured.

Other writing stuff

Conference reading lists for CHI 2022 and CSCW 2021
Short summaries of recommended human-computer interaction papers.
Are Bots Ravaging Online Encyclopedias? [code]
October 2021. Blog post written by Abby Newcomb and Sokona Mangane about our work together during the UMN REU program. (I was a research mentor.)
A Reflection on Reflective Writing Center Work [pdf]
November 2016. Journal article published in WLN: A Journal of Writing Center Scholarship, authored by Renata Fitzpatrick, Julia Kroll, and myself while I was a Lead Writing Consultant at the Carleton College Writing Center.

The text contents of this site are licensed CC BY 4.0. In other words, feel free to use or reuse them (with attribution to Zach or a link to the original post).