Emergent Exploration

Exploration has always been a hot topic in RL research, for a good reason. It can help agents discover primitive skills that can be later reused for more complex tasks, and it can help avoid local ...

Dec 29, 2025 Artificial, Intelligence

Alignment Research Accelerates AI Progress

Alignment research, especially research on practical alignment methods for current AI systems, contributes significantly to AI progress, and will do so even more in the coming future. For instance,...

Oct 31, 2025 Artificial, Intelligence

Rethinking Verifiability

I want to share some thoughts on verifiability, inspired by recent works like AlphaEvolve and RLVR. I first clarify that there are several different scenarios in which verifiability manifests, and ...

Oct 11, 2025 Artificial, Intelligence

Noether's Theorem and Quasi-Symmetry

Noether’s theorem is a fundamental result in physics that relates symmetries to conservation laws. The usual statement of Noether’s theorem requires the action to be invariant under a continuous tr...

Oct 10, 2025 Physics

Writing Something is Better than Writing Nothing

When starting this blog, I thought I would be writing a bit more than I am doing right now. What happened? One reason is that it’s not as rewarding as I initially thought. For instance, I used to t...

May 22, 2025 Personal, Thoughts

D'Alembert's Principle and the Principle of Least Action

One of the most famous examples for introducing Lagrangian mechanics is deriving the dynamics of a pendulum. I was never taught d’Alembert’s principle, which is a crucial part of the derivation. T...

Dec 12, 2024 Physics

A Theory of Unsupervised Learning

Note (Dec 5 2025): I do have some original thoughts in the “Some additional thoughts” section, but as for the rest, you should just watch Marcus Hutter’s 1 hour lecture if you want to understand Il...

Nov 1, 2024 Artificial, Intelligence

1
1 / 1