Must Read papers

I am working on summarizing and highlighting the key points of each paper listed in this section individually. Stay tuned for updates to this page!

Core Papers

Other Evals-Related literature

Benchmarks for Evals

Benchmark Papers

Science of Evals

Other Benchmark and Evaluation Papers

Software

Core

Other

Miscellaneous

Core

Other

Red Teaming Litereature

Core

Other

Scalable Oversight

Core

Other

Scaling Laws & Emergent Behaviors

Core

Other

Science Tutorials

Core

Other

LLM Capabilities

Core

Other

LLM Steering - RLHF

Core

Other

Supervised Fine-Tuning & Prompting

Core

Other

Fairness, Bias, and Accountability

AI Governance

Core

Other