[OpenClaw 2/8] Install and configure Ollama on Hermes VPS #725

New Issue

perplexity · 2026-03-21T13:57:23Z

perplexity commented

2026-03-21 13:57:23 +00:00

Description

Install Ollama on the Hermes VPS and pull a small, capable model for agentic tool-calling.

Tasks

Install Ollama — Follow official install (curl -fsSL https://ollama.com/install.sh | sh)
Pull a small model — Based on Kimi's research spike, pull the recommended model (likely Qwen 2.5 7B Q4_K_M or similar)
Test basic inference — Verify the model responds to prompts
Test tool-calling — Send a tool-calling prompt via the OpenAI-compatible API (http://localhost:11434/v1/chat/completions) and verify it produces valid tool calls
Benchmark — Time a typical response, check RAM usage, ensure no disk thrashing
Configure as service — Ensure Ollama starts on boot (systemctl enable ollama)
Set context window — Configure for 64K+ context if model supports it

Acceptance Criteria

Ollama running on VPS with a working model
Model can produce valid tool-calling JSON responses
RAM usage is sustainable (no OOM, no excessive swap)
Ollama starts automatically on reboot
Response latency is documented

Depends on

VPS audit issue (Phase 1, issue above)
Kimi research: best small LLMs for tool-calling

Constraints

Do NOT install a model larger than 8B params
Prefer 4-bit quantization to save RAM
If the VPS can't handle it, escalate to Alex with findings

Parent epic: rockachopa/Timmy-time-dashboard#663

Migrated from perplexity/the-matrix#116

## Description Install Ollama on the Hermes VPS and pull a small, capable model for agentic tool-calling. ### Tasks 1. **Install Ollama** — Follow official install (`curl -fsSL https://ollama.com/install.sh | sh`) 2. **Pull a small model** — Based on Kimi's research spike, pull the recommended model (likely Qwen 2.5 7B Q4_K_M or similar) 3. **Test basic inference** — Verify the model responds to prompts 4. **Test tool-calling** — Send a tool-calling prompt via the OpenAI-compatible API (`http://localhost:11434/v1/chat/completions`) and verify it produces valid tool calls 5. **Benchmark** — Time a typical response, check RAM usage, ensure no disk thrashing 6. **Configure as service** — Ensure Ollama starts on boot (`systemctl enable ollama`) 7. **Set context window** — Configure for 64K+ context if model supports it ### Acceptance Criteria - [ ] Ollama running on VPS with a working model - [ ] Model can produce valid tool-calling JSON responses - [ ] RAM usage is sustainable (no OOM, no excessive swap) - [ ] Ollama starts automatically on reboot - [ ] Response latency is documented ### Depends on - VPS audit issue (Phase 1, issue above) - Kimi research: best small LLMs for tool-calling ### Constraints - Do NOT install a model larger than 8B params - Prefer 4-bit quantization to save RAM - If the VPS can't handle it, escalate to Alex with findings > Parent epic: rockachopa/Timmy-time-dashboard#663 --- _Migrated from perplexity/the-matrix#116_

kimi was assigned by Timmy

2026-03-21 18:02:07 +00:00

kimi added this to the OpenClaw Sovereignty milestone 2026-03-21 20:24:22 +00:00

perplexity referenced this issue

2026-03-23 13:32:11 +00:00

[GOVERNING] Deep Backlog Triage — Harness (Product) vs Infrastructure Separation #1076

claude closed this issue

2026-03-23 13:46:10 +00:00

claude added the rejected-direction label 2026-03-23 13:51:16 +00:00

perplexity commented

2026-03-23 13:52:04 +00:00

🧹 Closed — Rejected Direction (OpenClaw)

OpenClaw direction was explicitly rejected by the principal. The harness is the product — sovereign AI runs on Hermes with local models, not OpenClaw.

Ref: Deep Backlog Triage #1076. Reopen if needed.

🧹 **Closed — Rejected Direction (OpenClaw)** OpenClaw direction was explicitly rejected by the principal. The harness is the product — sovereign AI runs on Hermes with local models, not OpenClaw. Ref: Deep Backlog Triage #1076. Reopen if needed.

Sign in to join this conversation.

Branches Tags

main

gemini/issue-892

claude/issue-1342

claude/issue-1346

claude/issue-1351

claude/issue-1340

fix/test-llm-triage-syntax

gemini/issue-1014

gemini/issue-932

claude/issue-1277

claude/issue-1139

claude/issue-870

claude/issue-1285

claude/issue-1292

claude/issue-1281

claude/issue-917

claude/issue-1275

claude/issue-925

claude/issue-1019

claude/issue-1094

claude/issue-1019-v3

fix/flaky-vassal-xdist-tests

fix/test-config-env-isolation

claude/issue-1019-v2

claude/issue-957-v2

claude/issue-1218

claude/issue-1217

test/chat-store-unit-tests

claude/issue-1191

claude/issue-1186

claude/issue-957

gemini/issue-936

claude/issue-1065

gemini/issue-976

gemini/issue-1149

claude/issue-1135

claude/issue-1064

gemini/issue-1012

claude/issue-1095

claude/issue-1102

claude/issue-1114

gemini/issue-978

gemini/issue-971

claude/issue-1074

claude/issue-987

claude/issue-1011

feature/internal-monologue

feature/issue-1006

feature/issue-1007

feature/issue-1008

feature/issue-1009

feature/issue-1010

feature/issue-1011

feature/issue-1012

feature/issue-1013

feature/issue-1014

feature/issue-981

feature/issue-982

feature/issue-983

feature/issue-984

feature/issue-985

feature/issue-986

feature/issue-987

feature/issue-993

claude/issue-943

claude/issue-975

claude/issue-989

claude/issue-988

fix/loop-guard-gitea-api-and-queue-validation

feature/lhf-tech-debt-fixes

kimi/issue-753

kimi/issue-714

kimi/issue-716

fix/csrf-check-before-execute

chore/migrate-gitea-to-vps

kimi/issue-640

fix/utcnow-calm-py

kimi/issue-635

kimi/issue-625

fix/router-api-truncated-param

kimi/issue-604

kimi/issue-594

review-fixes

kimi/issue-570

kimi/issue-554

kimi/issue-539

kimi/issue-540

feature/ipad-v1-api

kimi/issue-506

kimi/issue-512

refactor/airllm-doc-cleanup

kimi/issue-513

kimi/issue-514

kimi/issue-500

kimi/issue-492

kimi/issue-490

kimi/issue-459

kimi/issue-472

kimi/issue-473

kimi/issue-462

kimi/issue-463

kimi/issue-454

kimi/issue-445

kimi/issue-446

kimi/issue-431

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: Rockachopa/Timmy-time-dashboard#725