RAG implementation to support LLM based support system

The RAG implementation can be stretched to different parts of the services but right now starting with the simplest
- Merchant support system
  - Refund policy queries

This will be more about tracking the 
-  token usage 
- latency 
- increasing accuracy (goal is 0 hallucination - no answer better than wrong answer)