Impromptu - Classical Music Discovery Chatbot

Chatbot intended to help users discover classical music.

Embabel features:

Agent-based chatbot with RAG (Neo4j vector storage)
Proposition extraction pipeline for memories about users
Spotify integration for playlist management

Getting Started

Prerequisites

API Key: Set at least one LLM provider API key as an environment variable:

# For OpenAI (GPT models)
export OPENAI_API_KEY=sk-...

# For Anthropic (Claude models)
export ANTHROPIC_API_KEY=sk-ant-...

The model configured in application.yml determines which key is required. The default configuration uses OpenAI.

Java: Java 21+ is required.

Docker: Required for running Neo4j.

Starting Neo4j

The application uses Neo4j as its vector store for RAG. Start it with Docker Compose:

docker compose up -d

This starts Neo4j with:

Bolt port: 7888 (for application connections)
HTTP port: 8889 (for Neo4j Browser at http://localhost:8889)
Credentials: neo4j / brahmsian

To stop Neo4j:

docker compose down

To wipe all data and start fresh:

docker compose down -v

Loading Open Opus Data

The application can load composer and works data from Open Opus, a free, open-source classical music database.

Load into Neo4j (with the app running):

# Load data (fetches directly from Open Opus API)
curl -X POST http://localhost:8888/api/openopus/load

# Clear all Open Opus data
curl -X DELETE http://localhost:8888/api/openopus

This creates a normalized graph with:

Composer nodes linked to Epoch (Baroque, Romantic, etc.)
Work nodes linked to Genre (Orchestral, Chamber, Keyboard, etc.)
COMPOSED relationships connecting composers to their works

Example Cypher queries after loading:

// Find all Romantic composers
MATCH (c:Composer)-[:OF_EPOCH]->(e:Epoch {name: "Romantic"})
RETURN c.completeName

// Find all orchestral works by Brahms
MATCH (c:Composer {name: "Brahms"})-[:COMPOSED]->(w:Work)-[:OF_GENRE]->(g:Genre {name: "Orchestral"})
RETURN w.title

// Count works by genre
MATCH (w:Work)-[:OF_GENRE]->(g:Genre)
RETURN g.name, count(w) as works ORDER BY works DESC

Ingesting Documents

The application can ingest documents (PDF, HTML, Markdown, etc.) into the RAG store for retrieval-augmented generation.

Ingest a URL (e.g., Project Gutenberg):

curl -X POST "http://localhost:8888/api/documents/ingest?location=https://www.gutenberg.org/files/56208/56208-h/56208-h.htm"

Ingest a local file:

curl -X POST "http://localhost:8888/api/documents/ingest?location=./data/schumann/musicandmusician001815mbp.md"

Ingest all files in a directory:

curl -X POST "http://localhost:8888/api/documents/ingest-directory?path=./data"

Check store status:

curl http://localhost:8888/api/documents/info

Supported formats: .txt, .md, .html, .htm, .pdf, .docx, .doc, .rtf, .odt

Documents are parsed using Apache Tika, which extracts hierarchical structure (headings, sections) and chunks the content for embedding. The endpoint is idempotent - documents that already exist (by URI) are skipped.

Running the Web App

After Neo4j is running:

./mvnw spring-boot:run

The app runs on port 8888 (double the 88 piano keys) at http://127.0.0.1:8888/chat

A "Neo4j Browser" link in the footer opens the database UI with credentials pre-filled.

Important: Use 127.0.0.1 (loopback address), not localhost, for OAuth to work correctly with both Google and Spotify.

Google OAuth2 Authentication

The web interface supports Google OAuth2 for user authentication. To enable it:

Go to https://console.cloud.google.com/
Create a new project or select an existing one
Navigate to APIs & Services > Credentials
Create an OAuth client ID (Web application type)
Add authorized JavaScript origins: http://127.0.0.1:8888
Add authorized redirect URIs: http://127.0.0.1:8888/login/oauth2/code/google
Set environment variables with your credentials:

export GOOGLE_CLIENT_ID="your-client-id.apps.googleusercontent.com"
export GOOGLE_CLIENT_SECRET="your-client-secret"

Without these credentials, the app falls back to anonymous user mode.

Spotify Integration (Optional)

After logging in with Google, users can link their Spotify account to enable playlist management through the chatbot.

To enable Spotify integration:

Go to https://developer.spotify.com/dashboard
Create an app (or select existing)
Add redirect URI: http://127.0.0.1:8888/callback/spotify (loopback, not localhost)
In User Management, add your Spotify email as a user (required for development mode)
Set environment variables:

export SPOTIFY_CLIENT_ID="your-spotify-client-id"
export SPOTIFY_CLIENT_SECRET="your-spotify-client-secret"

Once configured, a "Link Spotify" button appears in the header after Google login. The chatbot can then:

List your Spotify playlists
Search for tracks
Create new playlists
Add tracks to playlists

Features

Dark Concert Hall Theme: Elegant dark theme with gold accents, inspired by classical concert venues
Knowledge Base Panel: Collapsible panel showing extracted propositions from conversations
Real-time Chat: Streaming responses from the RAG-powered chatbot
User Authentication: Optional Google OAuth2 login
Spotify Integration: Link your Spotify account to create and manage playlists through the chatbot
Neo4j Browser: Direct link to explore the graph database

DICE REST API (Proposition Memory)

The application exposes DICE (Domain-Integrated Context Engineering) REST API endpoints for managing proposition-based memory. These endpoints are protected by API key authentication.

Enabling the REST Endpoints

To enable DICE REST endpoints in your application:

Import the configuration in your main application class:

import com.embabel.dice.web.rest.DiceRestConfiguration;
import org.springframework.context.annotation.Import;

@SpringBootApplication
@Import(DiceRestConfiguration.class)
public class MyApplication { }

Configure Spring Security to allow API key authentication (bypass OAuth/session auth):

@Override
public void configure(WebSecurity web) throws Exception {
    web.ignoring().requestMatchers("/api/v1/**");
    super.configure(web);
}

Add API key configuration to application.yml:

dice:
  security:
    api-key:
      enabled: true
      keys:
        - your-api-key-here

Provide a SchemaRegistry bean (see Schema Registry section below).

Authentication

All DICE endpoints require an API key header:

curl -H "X-API-Key: impromptu-admin" http://localhost:8888/api/v1/contexts/user123/memory

The default API key is impromptu-admin (configured in application.yml).

Extract Propositions from Text

# Extract propositions from text
curl -X POST http://localhost:8888/api/v1/contexts/alice_default/extract \
  -H "Content-Type: application/json" \
  -H "X-API-Key: impromptu-admin" \
  -d '{
    "text": "Johann Sebastian Bach composed the Brandenburg Concertos in 1721. He was born in Eisenach, Germany.",
    "sourceId": "music-facts"
  }'

Extract with User Association

Use knownEntities to associate extracted propositions with a user or other entities:

curl -X POST http://localhost:8888/api/v1/contexts/alice_default/extract \
  -H "Content-Type: application/json" \
  -H "X-API-Key: impromptu-admin" \
  -d '{
    "text": "I really enjoyed the Brahms Symphony No. 4 performance last night.",
    "sourceId": "user-conversation",
    "knownEntities": [
      {
        "id": "alice_id",
        "name": "Alice",
        "type": "User",
        "description": "A music enthusiast who loves classical music",
        "role": "The user in the conversation"
      }
    ]
  }'

The knownEntities array accepts entities with:

id - Unique identifier for the entity
name - Display name
type - Entity type label (e.g., "User", "Composer", "Work")
description - Optional description of the entity (defaults to name if not provided)
role - Descriptive role explaining context (e.g., "The user in the conversation", "A referenced composer")

Extract with Named Schema

If multiple schemas are registered, specify which one to use:

curl -X POST http://localhost:8888/api/v1/contexts/alice_default/extract \
  -H "Content-Type: application/json" \
  -H "X-API-Key: impromptu-admin" \
  -d '{
    "text": "The contract was signed on January 15, 2024.",
    "schemaName": "legal"
  }'

Extract Propositions from File

# Upload and process a document (PDF, Word, Markdown, HTML)
curl -X POST http://localhost:8888/api/v1/contexts/alice_default/extract/file \
  -H "X-API-Key: impromptu-admin" \
  -F "file=@./data/schumann/musicandmusician001815mbp.md" \
  -F "sourceId=schumann-writings"

# With schema name
curl -X POST http://localhost:8888/api/v1/contexts/alice_default/extract/file \
  -H "X-API-Key: impromptu-admin" \
  -F "file=@./document.pdf" \
  -F "schemaName=legal"

Query Memory

# Get all propositions for a context
curl -H "X-API-Key: impromptu-admin" \
  http://localhost:8888/api/v1/contexts/alice_default/memory

# Search by similarity
curl -X POST http://localhost:8888/api/v1/contexts/user123/memory/search \
  -H "Content-Type: application/json" \
  -H "X-API-Key: impromptu-admin" \
  -d '{
    "query": "What instruments did Bach play?",
    "topK": 5,
    "similarityThreshold": 0.7
  }'

# Get propositions about a specific entity
curl -H "X-API-Key: impromptu-admin" \
  "http://localhost:8888/api/v1/contexts/user123/memory/entity/bach-123"

Manage Propositions

# Get a specific proposition
curl -H "X-API-Key: impromptu-admin" \
  http://localhost:8888/api/v1/contexts/alice_default/memory/prop-456

# Delete a proposition (soft delete)
curl -X DELETE -H "X-API-Key: impromptu-admin" \
  http://localhost:8888/api/v1/contexts/alice_default/memory/prop-456

Configuration

The DICE API key security is configured in application.yml:

dice:
  security:
    api-key:
      enabled: true
      keys:
        - impromptu-admin          # Add your API keys here
      headerName: X-API-Key        # Optional, defaults to X-API-Key
      pathPatterns:                # Optional, defaults to /api/v1/**
        - /api/v1/**

Custom API Key Authenticator

For production, implement a custom ApiKeyAuthenticator bean to validate keys against a database or secrets manager. When you provide your own bean, the in-memory authenticator is automatically disabled:

@Component
public class DatabaseApiKeyAuthenticator implements ApiKeyAuthenticator {

    private final ApiKeyRepository apiKeyRepository;

    public DatabaseApiKeyAuthenticator(ApiKeyRepository apiKeyRepository) {
        this.apiKeyRepository = apiKeyRepository;
    }

    @Override
    public AuthResult authenticate(String apiKey) {
        return apiKeyRepository.findByKey(apiKey)
            .map(key -> new AuthResult.Authorized(
                key.getClientName(),
                Map.of("scopes", key.getScopes())
            ))
            .orElseGet(() -> new AuthResult.Unauthorized("Invalid API key"));
    }
}

The AuthResult.Authorized can include a principal name and metadata map, which are stored in request attributes for downstream use:

dice.auth.principal - The authenticated client identifier
dice.auth.metadata - Additional metadata (scopes, rate limits, etc.)

Schema Registry

To support multiple named schemas, provide a SchemaRegistry bean:

@Bean
SchemaRegistry schemaRegistry(DataDictionary defaultSchema) {
    InMemorySchemaRegistry registry = InMemorySchemaRegistry.withDefault(defaultSchema);
    registry.register("music", DataDictionary.fromClasses(Composer.class, Work.class));
    registry.register("legal", DataDictionary.fromClasses(Contract.class, Party.class));
    return registry;
}

If no SchemaRegistry bean is provided, wrap your default DataDictionary with InMemorySchemaRegistry.withDefault(schema).

Implementation Details

Neo4j Vector Storage

The application uses Neo4j as its vector store for RAG, configured via application.yml:

database:
  datasources:
    neo:
      type: NEO4J
      host: ${NEO4J_HOST:localhost}
      port: ${NEO4J_PORT:7888}
      user-name: ${NEO4J_USERNAME:neo4j}
      password: ${NEO4J_PASSWORD:brahmsian}
      database-name: ${NEO4J_DATABASE:neo4j}

neo4j:
  http:
    port: ${NEO4J_HTTP_PORT:8889}

Key aspects:

Neo4j with vector indexes: Chunks are stored as nodes with vector embeddings for similarity search
Graph relationships: Content relationships can be modeled as edges in the graph
Persistent storage: Data survives container restarts (stored in Docker volume)
Configurable chunking: Content is split into chunks with configurable size (default 800 chars) and overlap (default 100 chars)
Admin queries: See cypher/admin.cypher for useful queries to inspect and manage the database

Chunking properties can be configured via application.yml:

impromptu:
  neo-rag:
    max-chunk-size: 800
    overlap-size: 100

Chatbot Creation

The chatbot is created in ChatConfiguration.java:

@Bean
Chatbot chatbot(AgentPlatform agentPlatform) {
    return AgentProcessChatbot.utilityFromPlatform(agentPlatform);
}

The AgentProcessChatbot.utilityFromPlatform() method creates a chatbot that automatically discovers all @Action methods in @EmbabelComponent classes. Any action with a matching trigger becomes eligible to be called when appropriate messages arrive.

Action Handling

Chat actions are defined in ChatActions.java:

@EmbabelComponent
public class ChatActions {

    private final ToolishRag toolishRag;
    private final ImpromptuProperties properties;
    private final SpotifyService spotifyService;

    public ChatActions(
            SearchOperations searchOperations,
            SpotifyService spotifyService,
            ApplicationEventPublisher eventPublisher,
            ImpromptuProperties properties) {
        this.toolishRag = new ToolishRag(
                "sources",
                "The music criticism written by Robert Schumann: His own writings",
                searchOperations)
                .withHint(TryHyDE.usingConversationContext());
        this.spotifyService = spotifyService;
        this.properties = properties;
    }

    @Action(canRerun = true, trigger = UserMessage.class)
    void respond(Conversation conversation, ImpromptuUser user, ActionContext context) {
        List<Object> tools = new LinkedList<>();
        if (user.isSpotifyLinked()) {
            tools.add(new SpotifyTools(user, spotifyService));
        }
        var assistantMessage = context.ai()
                .withLlm(properties.chatLlm())
                .withPromptContributor(user)
                .withReference(toolishRag)
                .withToolObjects(tools)
                .withTemplate("ragbot")
                .respondWithSystemPrompt(conversation, Map.of(
                        "properties", properties,
                        "user", user
                ));
        context.sendMessage(conversation.addMessage(assistantMessage));
    }
}

Key concepts:

@EmbabelComponent: Marks the class as containing agent actions that can be discovered by the platform
@Action annotation:
- trigger = UserMessage.class: This action is invoked whenever a UserMessage is received in the conversation
- canRerun = true: The action can be executed multiple times (for each user message)
ToolishRag as LLM reference:
- Wraps the SearchOperations (Neo4j vector store) as a tool the LLM can use
- When .withReference(toolishRag) is called, the LLM can search the RAG store to find relevant content
- The LLM decides when to use this tool based on the user's question
Spotify tools: When the user has linked their Spotify account, SpotifyTools is added as a tool object, enabling playlist management

Prompt Templates

Chatbot prompts are managed using Jinja templates rather than inline strings. This is best practice for chatbots because:

Prompts grow complex: Chatbots require detailed system prompts covering persona, guardrails, objectives, and behavior guidelines
Separation of concerns: Prompt engineering can evolve independently from Java code
Reusability: Common elements (guardrails, personas) can be shared across different chatbot configurations
Configuration-driven: Switch personas or objectives via application.yml without code changes

Separating Voice from Objective

The template system separates two concerns:

Objective: What the chatbot should accomplish - the task-specific instructions and domain expertise
Voice: How the chatbot should communicate - the persona, tone, and style of responses

This separation allows mixing and matching. You could have a "music" objective answered in the voice of Shakespeare or a different persona without duplicating instructions.

Template Structure

src/main/resources/prompts/
├── ragbot.jinja                    # Main template entry point
├── elements/
│   ├── guardrails.jinja            # Safety and content restrictions
│   └── personalization.jinja       # Dynamic persona/objective loader
├── personas/                       # HOW to communicate (voice/style)
│   ├── impromptu.jinja             # Default: friendly music guide
│   ├── shakespeare.jinja           # Elizabethan style
│   ├── bible.jinja                 # Biblical style
│   ├── adaptive.jinja              # Adapts to user
│   └── jesse.jinja                 # Casual style
└── objectives/                     # WHAT to accomplish (task/domain)
    ├── music.jinja                 # Classical music education (default)
    └── legal.jinja                 # Legal document analysis

How Templates Are Loaded

The main template ragbot.jinja composes the system prompt from reusable elements:

{% include "elements/guardrails.jinja" %}

{% include "elements/personalization.jinja" %}

Keep your responses under {{ properties.voice().maxWords() }} words unless they
MUST be longer for a detailed response or quoting content.

The personalization.jinja template dynamically includes persona and objective based on configuration:

{% set persona_template = "personas/" ~ voice.persona() ~ ".jinja" %}
{% include persona_template %}

{% set objective_template = "objectives/" ~ objective ~ ".jinja" %}
{% include objective_template %}

Configuration Reference

All configuration is externalized in application.yml, allowing behavior changes without code modifications.

application.yml Reference

database:
  datasources:
    neo:
      host: localhost
      port: 7888               # Neo4j Bolt port
      user-name: neo4j
      password: brahmsian

neo4j:
  http:
    port: 8889                 # Neo4j Browser HTTP port

impromptu:
  # RAG chunking settings
  neo-rag:
    max-chunk-size: 800        # Maximum characters per chunk
    overlap-size: 100          # Overlap between chunks for context continuity

  # LLM model selection and hyperparameters
  chat-llm:
    model: gpt-4.1-mini        # Model to use for chat responses
    temperature: 0.0           # 0.0 = deterministic, higher = more creative

  # Voice controls HOW the chatbot communicates
  voice:
    persona: impromptu         # Which persona template to use (personas/*.jinja)
    max-words: 250             # Hint for response length

  # Objective controls WHAT the chatbot accomplishes
  objective: music             # Which objective template to use (objectives/*.jinja)

embabel:
  models:
    default-llm:
      model: gpt-4.1-mini
    default-embedding-model:
      model: text-embedding-3-small

Switching Personas

To change the chatbot's personality, simply update the persona value:

impromptu:
  voice:
    persona: shakespeare     # Now responds in Elizabethan English

To use a different LLM:

impromptu:
  chat-llm:
    model: gpt-4.1           # Use the larger GPT-4.1 instead
    temperature: 0.7         # More creative responses

No code changes required - just restart the application.

Miscellaneous

Killing a Stuck Server Process

If your IDE dies or the server doesn't shut down cleanly, you may need to manually kill the process on port 8888:

lsof -ti :8888 | xargs kill -9

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.github		.github
.mvn/wrapper		.mvn/wrapper
.vscode		.vscode
cypher		cypher
data		data
docs		docs
images		images
scripts		scripts
src		src
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml
vite.config.ts		vite.config.ts

License

embabel/impromptu

Folders and files

Latest commit

History

Repository files navigation