Workshop Schedule 2025 · Saturday, November 8

09:00-09:15 Opening Remarks
09:15-10:15 BabyLM Challenge Orals
CLASS-IT: Conversational and Lecture-Aligned Small-Scale Instruction Tuning for BabyLMs Luca Capone, Alessandro Bondielli, Alessandro Lenci

Masked Diffusion Language Models with Frequency-Informed Training Despoina Kosmopoulou, Efthymios Georgiou, Vaggelis Dorovatas, Georgios Paraskevopoulos, Alexandros Potamianos

MoEP: Modular Expert Paths for Sample-Efficient Language Modeling Joonas Tapaninaho

Mask and You Shall Receive: Optimizing Masked Language Modeling for Pre-training BabyLMs Lukas Edman, Alexander Fraser

Once Upon a Time: Interactive Learning for Storytelling with Small Language Models Jonas Mayer Martins, Ali Hamza Bashir, Muhammad Rehan Khalid, Lisa Beinborn
10:15-11:00 Break
11:00-12:00 Invited Talk 1: Hai Hu - Benchmarking Baby and Large Language Models in Chinese

Abstract

In this talk, I will first briefly overview the efforts in creating linguistically oriented benchmarks in Chinese. I will discuss the design and construction of benchmarks targeting the orthography, phonology, syntax, logic and semantics, pragmatics and world knowledge of modern and classical Chinese, by our lab and other teams in the field. The evaluations of baby and large language models show that current LLMs are very powerful, especially with the addition of “reasoning” abilities. However, certain linguistic blind spots remain, and further refinement of evaluation tasks and methodologies is needed. Next, I will discuss ongoing studies in understanding the learning mechanisms of monolingual and bilingual language models involving Chinese. Finally, I will point out what the LM community might learn from the language acquisition community.

12:00-13:30 Lunch
13:30-15:00 Poster Session
15:00-15:30 Break
15:30-16:30 Invited Talk 2: Yohei Oseki - Small Language Models through Insights from Human Language Acquisition

Abstract

Large language models (LLMs) have achieved remarkable success, thanks to the rapid development of AI and machine/deep learning, and outperformed humans at various downstream tasks. However, those LLMs, despite their super-human performance, have been pointed out as not efficient in terms of training data, model parameters, and computational resources. In this talk, I propose small language models (SLMs) that efficiently learn natural language like humans, building on insights from human language acquisition. Specifically, SLMs are trained on developmentally plausible corpora like BabyLM Challenge via curriculum learning, batch learning, direct/indirect evidence, variation set, and critical period. The results suggest that inductive biases are essential to efficiently train SLMs, with scientific implications for human language acquisition, as well as engineering applications to edge AI and low-resource languages.

16:30-17:05 BabyLM Workshop Orals
Teacher Demonstrations in a BabyLM’s Zone of Proximal Development for Contingent Multi-Turn Interaction Suchir Salhan, Hongyi Gu, Donya Rooein, Diana Galvan-Sosa, Gabrielle Gaudeau, Andrew Caines, Zheng Yuan, Paula Buttery

Are BabyLMs Deaf to Gricean Maxims? A Pragmatic Evaluation of Sample-Efficient Language Models Raha Askari, Sina Zarrieß, Özge Alacam, Judith Sieker

Looking to Learn: Token-wise Dynamic Gating for Low-Resource Vision-Language Modelling Bianca-Mihaela Ganescu, Suchir Salhan, Andrew Caines, Paula Buttery
17:05-17:15 Awards and Closing Remarks

Overview • Workshop Schedule • Posters • Guidelines • Timeline • FAQs • Previous papers

Workshop Schedule 2025 · Saturday, November 8