Explicit Web Search Button — 🔍 button next to SEND forces a web search, bypassing model uncertainty detection
Orange Search Styling — Search results, WEB badge, and search button share consistent orange color scheme
Expanded Refusal Patterns — Added "As an AI model", "based on my training data", "I don't have the capability"
Code cleanup — Removed unused JSONResponse import and dead raw_results_md variable
Bug fixes — Replaced bare except clauses with except Exception; corrected add_memory() return type to int | None; updated TemplateResponse call to Starlette's current API signature

What's New in v1.4.0

FTS5 Memory System: Say "remember that..." to store facts — they're automatically retrieved by relevance and injected into context
Forget Command: Say "forget about..." to remove memories
Memory Toggle: Enable/disable memory injection from topbar or settings
Multi-file Structure: Backend and frontend separated for easier maintenance

Features

Persistent Memory — SQLite FTS5 full-text search for fast, relevant memory retrieval
Web Search — SearXNG integration for automatic web lookups when the model is uncertain
Explicit Search — 🔍 button to force web search without waiting for model uncertainty
Profile Injection — Custom system prompt injected into every conversation
System Presets — Save and switch between different system prompts
Real-time Stats — CPU, RAM, GPU, VRAM monitoring in sidebar
Token Thermometer — Visual context window usage indicator
Streaming Responses — Server-sent events for real-time token display
Conversation History — SQLite-backed chat persistence with mass-delete option
Model Switching — Change Ollama models on the fly

Current WiP (Prioritized)

Canonical backlog: docs/wiki/current-wip.md

Scope boundary: local-first (same-host Ollama), optional RFC1918 LAN endpoints, no public Internet AI endpoints by default.

Total identified items: 26

Top 10 (brief):

P0: Add auth for write/admin endpoints
P0: Add CSRF/origin protection for state-changing requests
P0: Block unsafe URL schemes in rendered links
P0: Add rate limiting and request size limits
P1: Restrict /api/settings updates to allowlisted keys
P1: Add pagination + hard caps for list APIs
P1: Replace raw exception leakage with safe client errors
P1: Add automated tests for streaming/search/memory paths
P2: Implement MCP-style skills/tool-call framework
P2: Implement heartbeat/check-in scheduler + summary endpoint

Item 1 executive summary: keep guest mode for conversational chat, require 4-digit admin PIN for advanced/destructive actions, and enforce local/LAN-only backend policy by default.

Implementation status: complete (guest session by default + admin unlock + admin-only write enforcement + origin checks + audit logging + capability tests).

TODO

~~Verify SearXNG and Docker services persist across reboots~~
Conversation search/filter by keyword
Export conversation to markdown/text
Keyboard shortcuts (Ctrl+N new chat, Ctrl+Enter send)
Retry button on assistant messages
Source links — clickable links when search used
Allow conversation renaming
Multiple profiles — coding/sysadmin/general
Auto-generate conversation tags (client-side KWIC, top 5, filterable badges)
Image input support — pull vision model, file input/drag-drop, base64 encode, pass images array to Ollama /api/chat
Split-screen option for btop display
Skills as markdown files — /opt/jarvischat/skills/, YAML frontmatter + instructions, injected into context for tool calls
Heartbeats / proactive check-ins — cron + endpoint for daily briefings, HA anomaly alerts
Model info button — (i) icon next to Model dropdown, shows div with model description, last updated date, best-use purpose
Set default model — toggle any model as the default selection
Hide/remove model from list — exclude models from dropdown
Update model function — trigger ollama pull for selected model from UI
Add mouseover tooltip to SEND button

File Structure

/opt/jarvischat/
├── app.py              # FastAPI backend
├── jarvischat.db       # SQLite database (auto-created)
├── static/
│   └── logo.png        # Logo image (optional)
└── templates/
    └── index.html      # Frontend

Requirements

Python 3.11+ (tested on 3.13)
Ollama running locally or on network
SearXNG (optional, for web search)

Installation

Fresh Install

# Create directory and venv
sudo mkdir -p /opt/jarvischat
sudo chown $USER:$USER /opt/jarvischat
cd /opt/jarvischat
python3 -m venv venv

# Install dependencies
./venv/bin/pip install fastapi uvicorn httpx psutil jinja2 python-multipart

# Set admin PIN before first startup (4 digits)
export JARVISCHAT_ADMIN_PIN=4827

# Create subdirectories
mkdir -p templates static

# Copy files
# (copy app.py to /opt/jarvischat/)
# (copy index.html to /opt/jarvischat/templates/)
# (copy logo.png to /opt/jarvischat/static/ — optional)

WARNING: Do not use 1234 as your admin PIN unless you accept weak local security.

NOTE: First boot now requires JARVISCHAT_ADMIN_PIN unless you explicitly opt into insecure fallback with JARVISCHAT_ALLOW_DEFAULT_PIN=true.

Upgrading from v1.4.x

cd /opt/jarvischat

# Backup
cp app.py app.py.bak
cp templates/index.html templates/index.html.bak

# Copy new files
# (copy app.py, replacing old version)
# (copy index.html to templates/)

# Restart
sudo systemctl restart jarvischat

Systemd Service

Create /etc/systemd/system/jarvischat.service:

[Unit]
Description=JarvisChat - Local Ollama Web Interface
After=network.target

[Service]
Type=simple
User=jarvischat
Group=jarvischat
WorkingDirectory=/opt/jarvischat
ExecStart=/opt/jarvischat/venv/bin/uvicorn app:app --host 0.0.0.0 --port 8080
Restart=always
RestartSec=5

[Install]
WantedBy=multi-user.target

sudo systemctl daemon-reload
sudo systemctl enable jarvischat
sudo systemctl start jarvischat

Memory Commands

In chat, natural language triggers memory operations:

You say	What happens
"remember that I prefer Rust over Go"	Stores as `preference`
"remember that JarvisChat runs on port 8080"	Stores as `infrastructure`
"note that the deadline is Friday"	Stores as `general`
"forget about the deadline"	Removes matching memories

Memories are automatically searched based on your message content and injected into the system prompt when relevant.

Memory Topics

Memories are auto-categorized:

preference — likes, dislikes, choices
project — active work, repos, tasks
infrastructure — servers, services, configs
personal — name, location, background
general — everything else

API Endpoints

Memory

Method	Endpoint	Description
GET	`/api/memories`	List all memories
POST	`/api/memories`	Add memory `{"fact": "...", "topic": "general"}`
DELETE	`/api/memories/{rowid}`	Delete memory by ID
GET	`/api/memories/search?q=term`	Search memories
GET	`/api/memories/stats`	Get counts by topic

Chat & Models

Method	Endpoint	Description
GET	`/api/models`	List available Ollama models
POST	`/api/chat`	Send message (streaming SSE)
POST	`/api/search`	Explicit web search (streaming SSE)
POST	`/api/show`	Get model info (context size)
GET	`/api/ps`	Get running models

Settings & Profile

Method	Endpoint	Description
GET	`/api/profile`	Get profile content
PUT	`/api/profile`	Update profile
GET	`/api/profile/default`	Get default profile
GET	`/api/settings`	Get settings
PUT	`/api/settings`	Update settings

Conversations

Method	Endpoint	Description
GET	`/api/conversations`	List conversations
GET	`/api/conversations/{id}`	Get conversation with messages
DELETE	`/api/conversations/{id}`	Delete conversation
DELETE	`/api/conversations`	Delete ALL conversations

Presets

Method	Endpoint	Description
GET	`/api/presets`	List presets
POST	`/api/presets`	Create preset
PUT	`/api/presets/{id}`	Update preset
DELETE	`/api/presets/{id}`	Delete preset

System

Method	Endpoint	Description
GET	`/api/stats`	CPU, RAM, GPU, VRAM stats
GET	`/api/search/status`	SearXNG availability

Configuration

Settings are stored in the settings table and include:

profile_enabled — Inject profile into chats (true/false)
search_enabled — Auto web search (true/false)
memory_enabled — Memory injection (true/false)
default_model — Default Ollama model
searxng_url — SearXNG instance URL (default: http://localhost:8888)

Testing Memory

# Add a memory via API
curl -X POST http://jarvis:8080/api/memories \
  -H "Content-Type: application/json" \
  -d '{"fact": "User prefers native installs over Docker", "topic": "preference"}'

# Search memories
curl "http://jarvis:8080/api/memories/search?q=docker"

# List all memories
curl http://jarvis:8080/api/memories

# Get stats
curl http://jarvis:8080/api/memories/stats

Or in chat:

Say "remember that I hate YAML"
Later ask "what markup languages should I avoid?"
JarvisChat will inject the YAML preference into context

Troubleshooting

Service won't start

Check logs:

journalctl -u jarvischat -n 50 --no-pager

Common issues:

Missing jinja2: ./venv/bin/pip install jinja2
Missing templates/ directory
Wrong permissions on /opt/jarvischat

Memory not working

Check memory is enabled (🧠 MEM ON in topbar)
Verify memories exist: curl http://jarvis:8080/api/memories
Check FTS5 table: sqlite3 jarvischat.db "SELECT * FROM memories_fts;"

Web search not working

Verify SearXNG is running: curl http://localhost:8888/search?q=test&format=json
Check search status: curl http://jarvis:8080/api/search/status
Ensure JSON format is enabled in SearXNG settings

License

MIT

Repository

Gitea: ssh://gitea@llgit.llamachile.tube:1319/gramps/jarvisChat.git