Quick recap on the state of language model reasoning
This is a talk I gave at the NeurIPS at the Latent Space, unofficial industry track. I wanted to directly address the question on if language models can reason and what o1 and the reinforcement finetuning API tell us about it.
You can access the slides here.
https://docs.google.com/presentation/d/1PNipMudHb5HTNnVosve0lqdrgwKSWck4SCE9TSPCJkY/edit?usp=sharing
Get Interconnects (https://www.interconnects.ai/)...
... on YouTube: https://www.youtube.com/@interconnects
... on Twitter: https://x.com/interconnectsai
... on Linkedin: https://www.linkedin.com/company/interconnects-ai
... on Spotify: https://open.spotify.com/show/2UE6s7wZC4kiXYOnWRuxGv
… on Apple Podcasts: https://podcasts.apple.com/us/podcast/interconnects/id1719552353
More information, transcript, etc: https://www.interconnects.ai/p/the-state-of-reasoning