2 17 21

Tommaso Bonomo

tommasobonomo

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs

updated a dataset 20 days ago

sapienzanlp/LiteraryQA

updated a dataset 20 days ago

sapienzanlp/bookcoref

View all activity

Organizations

upvoted a paper 18 days ago

Tower+: Bridging Generality and Translation Specialization in Multilingual LLMs

Paper • 2506.17080 • Published Jun 20, 2025 • 7

updated 2 datasets 20 days ago

sapienzanlp/LiteraryQA

Updated 20 days ago • 648 • 3

sapienzanlp/bookcoref

Updated 20 days ago • 350 • 11

New activity in tommasobonomo/ITAGutenberg about 2 months ago

[bot] Conversion to Parquet

#1 opened about 2 months ago by

parquet-converter

liked a dataset about 2 months ago

tommasobonomo/ITAGutenberg

Viewer • Updated Dec 10, 2025 • 1.08k • 5 • 1

updated a dataset about 2 months ago

tommasobonomo/ITAGutenberg

Viewer • Updated Dec 10, 2025 • 1.08k • 5 • 1

published a dataset about 2 months ago

tommasobonomo/ITAGutenberg

Viewer • Updated Dec 10, 2025 • 1.08k • 5 • 1

liked a dataset about 2 months ago

sapienzanlp/LiteraryQA

Updated 20 days ago • 648 • 3

upvoted a collection 2 months ago

ITA-Bench: Italian Benchmarks for LLMs

Collection

A collection of Italian benchmarks for Large Language Models. See also our Github repo: https://github.com/SapienzaNLP/ita-bench • 23 items • Updated Nov 22, 2025 • 8

authored a paper 3 months ago

LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA

Paper • 2510.13494 • Published Oct 15, 2025 • 2

upvoted a paper 3 months ago

LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA

Paper • 2510.13494 • Published Oct 15, 2025 • 2

published a dataset 3 months ago

sapienzanlp/LiteraryQA

Updated 20 days ago • 648 • 3

liked a dataset 3 months ago

BSC-LT/multi_lmentry

Viewer • Updated Nov 10, 2025 • 938k • 713 • 12

liked 2 models 3 months ago

SemanticAlignment/Llama-3.1-8B-Italian-SAVA

Text Generation • 7B • Updated May 2, 2025 • 3 • 1

SemanticAlignment/Llama-3.1-8B-Italian-LAPT

Text Generation • 8B • Updated May 2, 2025 • 2 • 2

liked a dataset 4 months ago

sapienzanlp/indaqa

Viewer • Updated 20 days ago • 362 • 59 • 3

commented on SmolLM3: smol, multilingual, long-context reasoner 5 months ago

Hi there! Congrats on the great work, really appreciate seeing discussions like these in the open ✨
Just one question: in the long context extension phase you mention using an extra 100B tokens - from where do you source them? Are they from the same sources as the pretraining, with different upscaling weights?
In general, I would really appreciate it if you could point me to some resource/inspiration regarding what data to use for the long-context extension!