Publications

(2024). Efficient Parallelization Layouts for Large-Scale Distributed Model Training. COLM.

PDF

(2024). NextLevelBERT: Masked Language Modeling with Higher-Level Representations for Long Documents. In ACL.

PDF

(2024). CommitBench: A Benchmark for Commit Message Generation. In SANER 2024.

PDF Code Dataset

(2023). GorillaVision- Open-Set Re-Identification of Wild Gorillas. In Third International Workshop on Camera Traps, Artificial Intelligence, and Ecology, Jena, Germany.

PDF Code