🤝 Open to Collab

Michal Valko

misovalko

3 2 2

https://misovalko.github.io/

AI & ML interests

large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models

Recent Activity

liked a dataset 17 days ago

ulamai/verified-research-reasoning-trajectories

authored a paper about 1 month ago

Spectral bandits for smooth graph functions with applications in recommender systems

updated a dataset about 2 months ago

misovalko/my-research-papers

View all activity

Organizations

liked a dataset 17 days ago

ulamai/verified-research-reasoning-trajectories

Viewer • Updated May 18 • 12 • 145 • 1

authored a paper about 1 month ago

Spectral bandits for smooth graph functions with applications in recommender systems

Paper • 2605.20552 • Published May 19

updated a dataset about 2 months ago

misovalko/my-research-papers

Updated May 21 • 24

authored 11 papers 2 months ago

Optimal last-iterate convergence in matrix games with bandit feedback using the log-barrier

Paper • 2604.15242 • Published Apr 16

Best of both worlds: Stochastic & adversarial best-arm identification

Paper • 2604.14860 • Published Apr 16

Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning

Paper • 2604.14974 • Published Apr 16

The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback

Paper • 2604.16087 • Published Apr 17

Adaptive multi-fidelity optimization with fast learning rates

Paper • 2604.16239 • Published Apr 17

Sample Complexity Bounds for Stochastic Shortest Path with a Generative Model

Paper • 2604.16111 • Published Apr 17

authored 6 papers 3 months ago

Spectral Thompson sampling

Paper • 2604.13739 • Published Apr 15

Covariance-adapting algorithm for semi-bandits with application to sparse rewards

Paper • 2604.13738 • Published Apr 15

Online learning with noisy side observations

Paper • 2604.13740 • Published Apr 15

Online Semi-Supervised Learning on Quantized Graphs

Paper • 1203.3522 • Published Mar 15, 2012

Derivative-Free & Order-Robust Optimisation

Paper • 1910.04034 • Published Oct 22, 2019

A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption

Paper • 1810.00997 • Published Feb 23, 2019

Michal Valko

AI & ML interests

Recent Activity

Organizations

misovalko's activity