Papers I Liked 2024

This has been another year where I felt like I slacked on my reading, and that probably is genuinely true for the tumultuous last half, but my read folder lists 154, so I can pick out a few that I liked to highlight in this end-of-year tradition. These are in approximate chronological order by date read, and as always I make no claim to depth or representativeness, so even in areas I follow there’s a lot on my should-read list. Topics shifted during the year with major career changes, from LLMs and RL to more classic econometrics to a bit of computation and even some computational game theory, to applied economics. In the past year I’ve gone through full-time work at an AI startup, two academic job searches (one still ongoing: please hire me! I’m very willing to move abroad), Covid and other health issues, homelessness (I was fine, don’t worry about it), a visiting liberal arts teaching load with a new prep on short notice, and, um, a new gender, and still managed to get out two papers I’m proud of, with more ongoing. Since I am in self-promotion mode until someone gives me a job, my papers first:

  • Zhou, Huang, Azizzadenesheli, Childers, and Lipton. “Timing as an Action: Learning When to Observe and Act” AISTATS 2024
    • We derived reinforcement learning algorithms for what economists will recognize as the sticky information setting! This incorporates periods of non-observation but turns out to be easier than a POMDP because the learner has the option to observe at any time, for a cost. Led by a dream team of Helen Zhou (experiments, healthcare applications) and Audrey Huang (badass RL theorist).
  • Gupta, Lipton, and Childers. “Online Data Collection for Efficient Semiparametric Inference
    • Extends our previous work on Online Moment Selection for combining data sources adaptively via a kind of Bandit Generalized Method of Moments to the semiparametric/double-machine learning case with nonparametric nuisance parameters. Think adaptive experiments, but for a much broader class of possibly-observational settings and models. This also builds on some fun technical work on uniform-in-time confidence sequences.

Papers I liked

Related