Claude, in production
We build deeply on Anthropic’s Claude because it is a strong foundation for agentic work — long-horizon reasoning, reliable tool use, and an open integration standard in MCP. humaineeti remains model-agnostic (BYOM — bring your own model), but our default agentic stack is Claude, the Claude Agent SDK, and the Model Context Protocol.
- Agents that finish the work. Not a chatbot bolted onto a workflow — agents that research, build, evaluate, deploy, and govern, with humans in the loop where it counts.
- Adaptive thinking & effort control. Claude decides how much to reason per step; we tune the effort/cost/quality tradeoff per route rather than paying for maximum reasoning everywhere.
- Prompt caching for efficiency. Stable context is cached across turns, so long-running agents pay for what changes, not for re-reading the same preamble every request.
AI co-workers & subagents
The unit we design for is the Claude co-work — a team of Claude-powered agents that a single human directs. Each agent is accountable for its slice of the job, and human-in-the-loop guardrails sit at the decisions that matter (approvals, destructive actions, external side effects).
Subagents are how that team stays focused. A lead agent delegates independent sub-tasks to separate agents, each running in its own context window with its own tools — and only their conclusions return to the lead, so the main loop never drowns in detail it will not reuse. It is a first-class pattern across Claude’s agent tooling:
- In Claude Code, subagents run as distinct agent types — a read-only Explore agent that fans out searches, or custom agents you define under
.claude/agents/— spawned in parallel, each isolated from the others’ context. - On the Claude Agent SDK and Managed Agents, a coordinator agent delegates to a roster of agents, each running in its own context-isolated thread. Delegation is one level deep, so workers stay predictable and do not recursively spawn.
Two things make this pay off. Subagents can run on cheaper, faster Claude models — Haiku 4.5 is an excellent subagent model — so delegation also controls cost. And a fresh-context verifier subagent that reviews the lead agent’s work tends to catch more than self-review, because it judges the output without the bias of having produced it. The rule of thumb we build to: use subagents for parallel or independent workstreams, and keep tightly sequential steps in the main loop.
Agent Skills
A general-purpose agent that has to carry every specialist instruction in its context all the time is slow, expensive, and easy to confuse. Agent Skills solve that. A skill is a reusable, filesystem-based capability package — a folder with a SKILL.md — that loads on demand through progressive disclosure.
- The skill’s short description sits in context by default, so the agent knows the capability exists.
- The full instructions load only when a task actually calls for them — no context bloat, no cost paid for guidance that isn’t needed.
- The effect is a general Claude agent that becomes a specialist on your workflow the moment the work demands it, then steps back down.
humaineeti packages your recurring, well-specified workflows as Skills so they are consistent, versioned, and reusable across every agent that speaks Claude.
Custom MCP development
The Model Context Protocol (MCP) is the open standard, introduced by Anthropic, for connecting agents to tools, data, and services. An MCP server exposes its capabilities through one discoverable, model-agnostic interface — the same server works with Claude or any other MCP-aware client. Use established servers for commodity integrations; build your own for the systems that encode your business. That second half is custom MCP development, and it is where we do our deepest work.
Tools
Callable functions with typed inputs and outputs — query the CRM, post an invoice, run a domain calculation. Each one gated by scoped credentials and an allow-list.
Resources
Read-only data the model can fetch by URI — documents, records, reference tables. Treated as data, not instructions, with classification on retrieved content.
Prompts
Reusable templates the host can invoke — encoding your house style, review checklists, or extraction formats so behaviour is consistent across agents.
Build vs buy
Off-the-shelf servers for Slack, GitHub, Drive, Postgres. Custom servers for your CRM views, billing API, internal services, and domain models — where the business logic lives.
Every custom server humaineeti builds ships with scoped credentials per consumer, allow-lists on which tools an agent may call, and audit logging on every invocation — so an agent gets exactly the access the task needs, and every action is traceable.
How we build (Agent SDK + tool use)
Under the hood, our agents are built on the Claude Agent SDK and Claude’s tool use. You define your own tools; the SDK’s tool runner drives the agentic loop — calling the model, executing the tool, feeding results back, and repeating until the work is done.
- Tool runner for autonomy. For flows we want to run end-to-end, the tool runner handles the loop automatically.
- Manual loop for control. Where a step needs an approval gate or human-in-the-loop confirmation, we write the loop by hand and stop for sign-off before a hard-to-reverse action.
- Effort & adaptive thinking. We tune reasoning depth per route to balance cost and quality — higher effort where correctness matters, lower where speed does.
- Prompt caching. A frozen system prompt and deterministic tool set keep the cache warm, so long agent runs stay efficient.
Governance & evaluation
Agentic autonomy without governance is a liability, so we build the controls in from the start. Under humaineeti’s Responsible AI and Zero-Trust posture, every agent loop and every LLM invocation is traced, logged, and scored — on correctness, completeness, safety, and tool-call effectiveness.
That evidence is produced and evaluated by AI Eval Service — our Eval@Core capability — so an agent’s quality is a measured number, not a hope. It pairs naturally with our Agent Evaluations practice and the GenAI Delivery Factory for production rollout.
The Claude model line
humaineeti is model-agnostic, but when the answer is Claude, we pick the tier that fits the step — and one agent loop often uses several at once.
Claude Opus 4.8
The most capable Opus-tier model, with a 1M-token context window — our default for demanding, long-horizon agentic and knowledge work.
Claude Sonnet 4.6
The best balance of speed and intelligence — a strong workhorse for most agent steps and interactive workloads.
Claude Haiku 4.5
Fast and cost-effective for simple or high-volume work — and an excellent model to run subagents on.
Claude Fable 5
Anthropic’s most capable model, for the hardest long-horizon problems — deep reasoning and extended autonomous runs.
On-page FAQ
What is custom MCP development?
Building Model Context Protocol servers for your own systems — CRM, billing, internal APIs, domain models — rather than relying only on off-the-shelf servers. An MCP server exposes tools, resources, and prompts through one discoverable, model-agnostic interface, and humaineeti builds each with scoped credentials, allow-lists, and audit logging so an agent gets exactly the access it needs and nothing more.
What are Claude Agent Skills?
Reusable, filesystem-based capability packages — a folder with a SKILL.md — that load on demand via progressive disclosure. The skill’s short description sits in context; the full instructions load only when the task calls for it. That turns a general Claude agent into a specialist without bloating its context or cost.
What are Claude subagents?
Subagents let a lead Claude agent delegate independent sub-tasks to separate agents, each in its own context window, so only their conclusions return to the lead. In Claude Code they run as distinct agent types (a read-only Explore agent, or custom agents under .claude/agents/), spawned in parallel; on the Claude Agent SDK and Managed Agents a coordinator delegates to a roster of agents in context-isolated threads, one level deep. They can run on cheaper Claude models like Haiku 4.5 to control cost, and are best for parallel or independent workstreams rather than tightly sequential steps.
Which Claude model should we use?
It depends on the task. Claude Opus 4.8 (1M context) for demanding agentic work; Claude Sonnet 4.6 for the best speed/intelligence balance; Claude Haiku 4.5 for fast, cost-effective and subagent work; Claude Fable 5 for the hardest long-horizon problems. humaineeti is model-agnostic (BYOM) and often routes different steps of one loop to different tiers.