Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Its presumably llama just like Mistral.

Everything open source is llama now. Facebook all but standardized the architecture.

I dunno about the moe. Is there existing transformers code for that part? It kinda looks like there is based on the config.



Mistral is not llama architecture.

https://github.com/mistralai/mistral-src


Its basically llama architecture, all but drop in compatible with llama runtimes.


Because it's JSON? :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: