MyAgent

MyAgent is an intelligent multi-agent system built with Spring Boot that leverages LLM (Large Language Model) capabilities, web search, OCR (Optical Character Recognition), and Redis caching to answer user queries. It can process both text and image-based queries, extract relevant information, and provide AI-generated responses.

Features

AI-Powered Responses: Uses Google Gemini (via LangChain4j) to generate answers to user queries.
Web Search Integration: Automatically fetches and summarizes web results for queries that require up-to-date information.
Image Text Extraction: Accepts image URLs, extracts text using an external OCR API, and uses the extracted text as part of the query.
Caching with Redis: Stores query-response pairs in Redis for fast retrieval and reduced API usage.
REST API: Exposes endpoints for user interaction.
Swagger UI: Provides interactive API documentation.

Architecture Overview

UserQueryController: Handles incoming user requests, orchestrates image download, text extraction, and query processing.
AgentManager: Central orchestrator that decides whether to use cached responses, perform web search, or call the AI model.
MemoryAgent: Manages Redis caching for queries and responses.
WebSearchAgent: Integrates with Google Custom Search API to fetch web results.
AiResponseAgent: Connects to Google Gemini via LangChain4j for LLM-powered responses.
ImageExtractionAPI: Calls an external OCR API to extract text from images.

How It Works

User submits a query (optionally with an image URL) via the /user/ask endpoint.
If an image is provided, the system downloads it and extracts text using OCR.
The extracted text (if any) is combined with the user's query.
The system checks Redis for a cached response.
If not cached, it determines if the query needs a web search (e.g., contains "latest", "news", etc.).
The processed query is sent to the Gemini LLM for a response.
The response is cached in Redis and returned to the user.

API Endpoints

`POST /user/ask`

Request Body:

{
  "query": "What is the latest news about AI?",
  "image": "https://example.com/image.png" // optional
}

Response:
Returns the AI-generated answer, possibly including information extracted from the image and/or web search.

`GET /`

Redirects to Swagger UI for API exploration.

Configuration

Set the following environment variables (or define them in your application properties/yaml):

GEMINI_API_KEY - API key for Google Gemini.
WEB_SEARCH_API_KEY - API key for Google Custom Search.
WEB_SEARCH_ID - Search engine ID for Google Custom Search.
OCR_USERNAME - Username for the OCR API.
OCR_LICENCE - License code for the OCR API.
OCR_URL - Endpoint URL for the OCR API.
REDIS_HOST - Redis server hostname.
REDIS_PORT - Redis server port.
REDIS_PASSWORD - Redis server password.

Running the Project

Prerequisites

Java 21+
Maven
Redis server running and accessible

Build and Run

# Build the project
./mvnw clean package

# Run the application
java -jar target/MyAgent-0.0.1-SNAPSHOT.jar

Or use Docker:

docker build -t myagent .
docker run -e ... -p 8080:8080 myagent

Technologies Used

Spring Boot
LangChain4j (Google Gemini integration)
Google Custom Search API
Redis (Jedis client)
Lombok
WebFlux (WebClient)
Jackson (JSON processing)
Swagger/OpenAPI

Example Usage

Ask a question:
POST /user/ask with { "query": "Summarize the latest AI trends" }
Extract text from an image and ask:
POST /user/ask with { "image": "https://example.com/text-image.png" }
Combine both:
POST /user/ask with { "query": "What does this say?", "image": "https://example.com/text-image.png" }

License

This project is licensed under the Apache License 2.0.

Note:
You must provide valid API keys and configure Redis for the application.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.mvn/wrapper		.mvn/wrapper
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MyAgent

Features

Architecture Overview

How It Works

API Endpoints

`POST /user/ask`

`GET /`

Configuration

Running the Project

Prerequisites

Build and Run

Technologies Used

Example Usage

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MyAgent

Features

Architecture Overview

How It Works

API Endpoints

POST /user/ask

GET /

Configuration

Running the Project

Prerequisites

Build and Run

Technologies Used

Example Usage

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

`POST /user/ask`

`GET /`

Packages