- Change 'no API dependency' to 'API fallback available for large models' - Update package tagline to 'local inference capability' Closes #907
5.9 KiB
Service Offerings
Who We Are
Whitestone Engineering is a human-led, AI-augmented engineering firm. Our principal engineer directs a fleet of five autonomous AI agents that build, test, and ship production infrastructure around the clock. We deliver at the speed and consistency of a 10-person team with the overhead of one.
Tier 1: Agent Infrastructure
For companies that want autonomous AI agents working for them.
What We Build
- Custom AI agent deployment using our battle-tested Hermes framework
- Multi-agent fleet orchestration with persistent memory and skills systems
- MCP (Model Context Protocol) server development and integration
- Local LLM inference stacks (Ollama, vLLM, custom model serving)
- Agent-to-agent communication networks
- Cron-scheduled autonomous workflows (burn-mode operations)
- Multi-platform agent gateways (Telegram, Discord, Slack, custom)
- Self-hosted code forge setup with full CI/CD integration
Pricing
- Hourly: $400 — $600/hr
- Project: $15,000 — $25,000+
- Retainer: $8,000 — $15,000/month
Ideal Client
- AI startups building agent products
- Companies wanting to deploy internal AI workforce
- Organizations needing sovereign (self-hosted) AI infrastructure
- Teams that want agents integrated into their existing toolchain
Deliverables Include
- Deployed agent system with documentation
- Monitoring and health check dashboards
- Runbook for operations and troubleshooting
- 30 days of post-deployment support
Tier 2: Security & Hardening
For companies that already have AI systems and need them locked down.
What We Build
- AI agent security audits (jailbreak resistance, prompt injection, data exfiltration)
- Conscience validation systems (ethical guardrails that actually work)
- Crisis detection and automated response pipelines
- CVE-class vulnerability identification and remediation
- Secure agent communication protocols
- Audit logging and compliance frameworks
- Red-teaming exercises against existing AI deployments
Pricing
- Hourly: $250 — $400/hr
- Project: $8,000 — $15,000
- Retainer: $5,000 — $10,000/month
Ideal Client
- Companies deploying customer-facing AI agents
- Regulated industries (finance, healthcare) using LLMs
- Organizations that have had AI safety incidents
- AI companies preparing for SOC 2 or similar compliance
Deliverables Include
- Security assessment report with severity ratings
- Remediation implementation (not just a report — we fix it)
- Jailbreak resistance test suite
- Ongoing monitoring recommendations
- Optional: retained security review as systems evolve
Tier 3: Automation & Research
For companies that need things built, automated, or investigated.
What We Build
- Webhook-driven CI/CD pipelines
- Automated data processing and ETL workflows
- Intelligence reports and research synthesis
- Custom tooling and scripts
- Infrastructure automation (Ansible, Terraform, shell)
- API integrations and middleware
- Technical due diligence reports
- Proof-of-concept development
Pricing
- Hourly: $150 — $250/hr
- Project: $5,000 — $10,000
- Retainer: $3,000 — $5,000/month
Ideal Client
- Startups that need a "get it done" engineering partner
- VCs needing technical due diligence on portfolio companies
- Companies drowning in manual processes
- Research teams that need technical implementation support
Deliverables Include
- Working automation/pipeline with documentation
- Source code in client's repository
- Handoff documentation for internal team
- 14 days of post-delivery support
Package Deals
Starter — $5,000
Get your first AI agent working for you.
- Single Hermes agent deployment
- Basic automation workflow (cron-scheduled tasks)
- One platform integration (Telegram, Discord, or Slack)
- Basic monitoring and alerting
- Documentation and runbook
- 14 days post-deployment support
Timeline: 1-2 weeks
Professional — $15,000
A multi-agent fleet that operates autonomously.
- Up to 3 Hermes agent instances
- Fleet coordination and task routing
- Multi-platform gateway (2+ platforms)
- Persistent memory and skills system
- Monitoring dashboard with health checks
- Webhook-driven automation pipelines
- Comprehensive documentation
- 30 days post-deployment support
Timeline: 3-4 weeks
Enterprise — $40,000+
Full sovereign infrastructure with local inference capability.
- Full agent fleet (5+ instances)
- Local LLM inference stack (API fallback available for large models)
- Self-hosted code forge (Gitea) with CI/CD
- Agent security hardening and conscience validation
- Nostr-based sovereign communication layer
- Custom agent skills development
- Burn-mode autonomous operation cycles
- Full test suite and quality assurance
- Dedicated support channel
- 90 days post-deployment support
- Priority response SLA
Timeline: 6-8 weeks
How We Work
- Discovery Call (30 min, free) — We learn about your problem
- Proposal (1-2 business days) — Detailed scope, timeline, and pricing
- Kickoff (Day 1) — 50% deposit, project begins immediately
- Delivery — Fleet builds, human reviews, client receives updates
- Handoff — Documentation, training, and support period begins
- Ongoing (optional) — Retained relationship for continued development
Why Us vs. Traditional Consultancies
| Factor | Traditional | Whitestone Engineering |
|---|---|---|
| Team size | Must hire/staff up | Fleet is always ready |
| Hours/day | 8 | 24 (agents don't sleep) |
| Ramp-up time | Weeks | Days |
| Consistency | Varies by person | Systematic and reproducible |
| AI expertise | Learning it | Built the infrastructure |
| Overhead | Office, HR, benefits | Lean and efficient |
| Cost | $300-500/hr billed | Competitive, transparent |
All prices are in USD. Custom scoping available for complex engagements. Volume discounts for multi-project commitments.