Andrea Santilli

Andrea Santilli

PhD Student in Computer Science

GLADIA, Sapienza University of Rome

Biography

Hi! I’m Andrea Santilli a third-year Ph.D. student in Computer Science at GLADIA, Sapienza University of Rome, supervised by Prof. Emanuele Rodolà.

My main research interests lie at the intersection of natural language processing, representation learning and machine intelligence. In particular, I have explored several topics like syntax in transformers, instruction-tuned LMs, audio LM, multimodal neural databases and efficient decoding techniques (checkout my publications!).

I’m an enthusiast of Open Source and Open Science initiatives. I have contributed to Hugginface’s BigScience and whenever possible I contribute to open-source projects (GitHub profile). If you are interested in what I’m doing and would like to contact me, you can send me a DM on Twitter or use the contact form below!

Interests
  • Natural Language Processing
  • Representation Learning
  • Machine Intelligence
Education
  • PhD in Artificial Intelligence

    Sapienza University of Rome

  • MSc in Computer Science, 2020

    University of Roma Tor Vergata

  • BSc in Computer Science, 2018

    University of Roma Tor Vergata

Publications

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their …
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Research Experience

 
 
 
 
 
Hugginface BigScience
Open Science Researcher
Jun 2021 – Jun 2022 Remote
Researcher at Hugginface’s workshop on large language models. Worked in the prompt-engineering working group on zero-shot generalization. Two publications.
 
 
 
 
 
ART Lab, University of Roma Tor Vergata
University Research Assistant
Jan 2018 – Jun 2020 Rome
Worked on syntax in deep neural network models and BERT-based NLP models.
 
 
 
 
 
Pi School, School of Artificial Intelligence
Research Engineer
Oct 2019 – Dec 2019 Rome
Worked on a European Commission project to promote entrepreneurship and tech transfer in the R&D area (“Started Project”) via NLP-based tools.
 
 
 
 
 
Mashfrog Group
NLP Research Engineer
Jul 2019 – Oct 2019 Rome
Research and development of NLP models (grammar error correction, language generation) for a web-based text editor for press releases.

Grants Awarded

Our project on efficient Machine Translation (MT) was selected as the winner of the category ‘Machine Learning Algorithms For Translation’ among different proposals submitted by world experts and professors (7% acceptance rate). We develop a novel decoding algorithm to speedup autoregressive transformers up to 2x and published the results at ACL 2023. PI: Andrea Santilli. Budget: 20.000€
ufi
Multimodal Artificial Intelligence for 3D shape analysis, modeling and applications
Joint project on multimodal 3D and NLP applications between our research group GLADIA at Sapienza and Maks Ovsjanikov’s group at Ecole Polytechnique. PI: Simone Melzi, Maks Ovsjanikov. Budget: 10.000€

Contact