publications

2023

  1. beyond-scale.png
    Beyond Scale: the Diversity Coefficient as a Data Quality Metric Demonstrates LLMs are Pre-trained on Formally Diverse Data
    Alycia Lee, Brando Miranda, and Sanmi Koyejo
    ICML Workshop on Data-centric Machine Learning Research and Workshop on Deployable Generative AI, 2023

2020

  1. signal-peptides.png
    Signal Peptides Generated by Attention-Based Neural Networks
    Zachary Wu, Kevin K. Yang, Michael J. Liszka, Alycia Lee, Alina Batzilla, David Wernick, David P. Weiner, and Frances H. Arnold
    ACS Synthetic Biology, 2020

2019

  1. stochasticBO.png
    Batched Stochastic Bayesian Optimization via Combinatorial Constraints Design
    Kevin K. Yang, Yuxin Chen, Alycia Lee, and Yisong Yue
    In Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics, 16–18 apr 2019