feat: v1.8.0 — reposition as homelab developer platform, wire inference to ultron llama-server

- Bump version to 1.8.0
- Add LLAMA_SERVER_BASE constant, point all inference calls to ultron:8081
- Update startup log to include llama-server endpoint
- Rewrite README: four pillars, cluster architecture diagram, AMD+NVIDIA RPC setup,
  layer tuning progression (7→17→30-35 t/s), full API reference, complete roadmap A-L
- Reframe project identity: knowledge accumulation platform, not chat wrapper
This commit is contained in:
2026-06-15 19:34:11 -07:00
parent 970abc8957
commit 5075a6bc55
2 changed files with 453 additions and 1 deletions

2
app.py
View File

@@ -1,6 +1,6 @@
#!/usr/bin/env python3
"""
JarvisChat - Lightweight Ollama Coding Companion
JarvisChat - Privacy-First Homelab Developer Platform
A minimal replacement for Open-WebUI that actually runs on Python 3.13
Talks to Ollama API on localhost:11434