Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

My own experience. I'm working on something complex that's not in the datasets these models were trained on. There I see V4 flash breaking down and hallucinating much more often than GPT/Claude. For normal, common tasks, I also don't see much of a difference.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: