The RAG implementation can be stretched to different parts of the services but right now starting with the simplest
This will be more about tracking the
- token usage
- latency
- increasing accuracy (goal is 0 hallucination - no answer better than wrong answer)
The RAG implementation can be stretched to different parts of the services but right now starting with the simplest
This will be more about tracking the