What local LLMs are you using to create embeddings for RAG?

Otto@programming.dev · 3 months ago

What local LLMs are you using to create embeddings for RAG?

Otto@programming.dev · 2 months ago

You mean ollama? There are so many options, any favorites?

andrew0@lemmy.dbzer0.com · 2 months ago

You’re right! Sorry for the typo. The older nomic-embed-text model is often used in examples, but granite-embedding is a more recent one and smaller for English-only text (30M parameters). If your use case is multi-language, they also offer a bigger one (278M parameters) that can handle English, German, Spanish, French, Japanese, Portuguese, Arabic, Czech, Italian, Korean, Dutch, Chinese (Simplified). I would test them out a bit to see what works best for you.

Furthermore, if you’re not dependent on MariaDB for something else in your system, there are also some other vector databases I would recommend. Qdrant also works quite well, and you can integrate it pretty easily in something like LangChain. It really depends on how much you want to push your RAG workflow, but let me know if you have any other questions.