> ## Documentation Index
> Fetch the complete documentation index at: https://docs.cloudthinker.io/llms.txt
> Use this file to discover all available pages before exploring further.

# CloudThinker

> AI-powered multi-agent platform for autonomous cloud operations — cost optimization, code review, and incident response across AWS, Azure, and GCP

<img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/unified-graph.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=cf565be92cc4f6760ba1dfb8b2d3f2ec" alt="CloudThinker Agentic Platform — modules and capabilities unified" style={{width: '100%', height: 'auto', background: '#0a0a0a', borderRadius: '12px', padding: '16px'}} width="800" height="800" data-path="images/platform/unified-graph.svg" />

AI agents that manage your infrastructure, review code, resolve incidents, and optimize costs — across multi-clouds, Kubernetes, and everything in between. Self-healing infra, autonomous.

<CardGroup cols={2}>
  <Card title="Quickstart" icon="rocket" href="/quickstart">
    Connect your cloud and run your first operation in 5 minutes
  </Card>

  <Card title="CloudThinker Language" icon="code" href="/guide/language">
    Master the `@agent` and `#tool` syntax for effective prompting
  </Card>
</CardGroup>

***

## Why Cloud Operations Is Broken

Cloud teams juggle 8–12 separate platforms for cost, security, monitoring, and operations. When a service goes down at 3 AM, engineers spend more time correlating dashboards than fixing the problem. Cost reviews happen monthly. Security audits take weeks. And every new cloud service adds another point tool to manage.

**Three compounding problems:**

| Problem            | Reality                                                                                                                                    |
| ------------------ | ------------------------------------------------------------------------------------------------------------------------------------------ |
| **Too many tools** | Cost Explorer for spend, Security Hub for compliance, Datadog for monitoring, kubectl for containers — and none of them talk to each other |
| **Too slow**       | Manual root-cause analysis takes hours; incidents stay open while engineers piece together the story across six dashboards                 |
| **Too hard**       | Deep cloud expertise required just to ask basic questions — a permanent blocker for developers and non-specialists                         |

***

## The CloudThinker Difference

Most cloud management tools are dashboards — they show you data, but you still have to interpret it, decide what to do, and manually execute changes.

CloudThinker acts.

| Capability            | Traditional Tools                   | CloudThinker                                                                                         |
| --------------------- | ----------------------------------- | ---------------------------------------------------------------------------------------------------- |
| **Cost visibility**   | Show historical spend charts        | Identify waste, recommend right-sizing, implement with approval                                      |
| **Security posture**  | Report misconfiguration counts      | Investigate context, prioritize risk, generate remediation plans                                     |
| **Incident response** | Page you and show metric graphs     | Suppress 98% of noise upstream, then investigate root cause autonomously before you open your laptop |
| **Compliance**        | Generate point-in-time scores       | Produce evidence, close gaps, maintain continuous compliance                                         |
| **Interface**         | Console + CLI + separate dashboards | Natural language across all domains, any cloud                                                       |

Five specialized AI agents replace the fragmented tool stack — and they coordinate with each other on complex problems.

***

## Core Modules

<CardGroup cols={3}>
  <Card title="Code Review" icon="code-pull-request" href="/guide/code-review/overview">
    AI-powered PR reviews with security analysis — 96% accuracy
  </Card>

  <Card title="Deep Response Engine" icon="triangle-exclamation" href="/guide/incident/overview">
    Pulse cuts the noise, AI investigates the rest — MTTR under 5 minutes
  </Card>

  <Card title="HelpDesk" icon="headset">
    Tiered support with intelligent escalation — 70% auto-resolved
  </Card>

  <Card title="Infra Ops" icon="server" href="/guide/infrastructure/resources">
    Unified cost, security, and performance — 325+ operations
  </Card>

  <Card title="CloudKeeper" icon="radar" href="/guide/infrastructure/cloudkeepers">
    24/7 autonomous monitoring and compliance
  </Card>

  <Card title="SlackOps" icon="slack" href="/guide/slack-integration">
    Conversational ops directly in Slack
  </Card>
</CardGroup>

<Tabs>
  <Tab title="Code Review">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/code-review.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=ebab376f4bad2681b9a32ef2d0c16c79" alt="Code review module with AI-powered analysis and security scanning" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/code-review.svg" />
      </div>
    </Frame>

    AI agents review every pull request for bugs, security vulnerabilities, and best-practice violations — then post actionable feedback directly in your PR. Security analysis runs automatically alongside code quality checks, catching issues before they reach production.
  </Tab>

  <Tab title="Deep Response Engine">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/incident.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=f0ea5b75936bf34d72c1b45e16168331" alt="Deep Response Engine — Pulse signal intelligence feeding AI-powered incident investigation" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/incident.svg" />
      </div>
    </Frame>

    From cloud event to resolution, end to end. **Pulse** ingests 10+ sources and cuts \~98% of noise through seven suppression layers and AI classification. **Incident** investigates the moment a cluster escalates — forming hypotheses, testing them against evidence, and executing remediation runbooks under your approval gates. MTTR as low as **4m 32s**.
  </Tab>

  <Tab title="HelpDesk">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/helpdesk.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=e7d8083f3675652ede2b771fef12a46e" alt="HelpDesk module with tiered support and intelligent escalation" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/helpdesk.svg" />
      </div>
    </Frame>

    A multi-tier support system where AI agents handle routine requests while intelligently escalating complex issues to the right specialist. **70% of tickets are auto-resolved** without human intervention.
  </Tab>

  <Tab title="Infra Ops">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/infrastructure.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=6c5dfcb0c5f685afa81bc06252b2ae52" alt="Infrastructure operations module covering cost, security, and performance" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/infrastructure.svg" />
      </div>
    </Frame>

    Three pillars in one unified interface: **Cost** optimization to eliminate waste, **Security** posture management to stay compliant, and **Performance** monitoring to keep systems healthy. Agents continuously analyze and recommend improvements across all three.
  </Tab>

  <Tab title="CloudKeeper">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/cloudkeeper.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=92d82cfdb9b1652cd31f741600d8b0c3" alt="CloudKeeper module providing 24/7 autonomous monitoring and compliance" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/cloudkeeper.svg" />
      </div>
    </Frame>

    Always-on autonomous agents that continuously scan your cloud environment for drift, compliance violations, and optimization opportunities. [CloudKeepers](/guide/infrastructure/cloudkeepers) work around the clock — surfacing issues before they become incidents.
  </Tab>

  <Tab title="SlackOps">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/slackops.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=efe3cfdca79abbb893d0df3829c62dbd" alt="SlackOps module enabling conversational operations in Slack" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/slackops.svg" />
      </div>
    </Frame>

    Run cloud operations directly from Slack with native conversational UI ([SlackOps](/guide/slack-integration)). Mention an agent, describe what you need, and get results — all without leaving your team's workspace.
  </Tab>
</Tabs>

***

## Platform Capabilities

The foundational layer that powers every module — composable, secure, and observable.

<CardGroup cols={2}>
  <Card title="Skills" icon="puzzle-piece">
    Composable skill definitions (SKILL.md, tools, prompts, guardrails, triggers)
  </Card>

  <Card title="Autonomy" icon="gauge-high">
    4-level autonomy: Notify, Suggest, Approve, Autonomous — RBAC-gated
  </Card>

  <Card title="Sandbox" icon="box">
    Ephemeral microVM environments with per-tenant VPC isolation
  </Card>

  <Card title="Runbook" icon="list-check">
    325+ pre-built operations with cron scheduling and chaining
  </Card>

  <Card title="Connections" icon="plug" href="/guide/connections/overview">
    MCP-based integrations — AWS, Azure, GCP, K8s, Slack, GitHub, Datadog, and more
  </Card>

  <Card title="Knowledge" icon="book" href="/guide/knowledge">
    Vectorized RAG knowledge base with continuous learning
  </Card>

  <Card title="Topology" icon="diagram-project" href="/guide/infrastructure/topology">
    Real-time resource mapping and dependency graphs across regions
  </Card>

  <Card title="Memory" icon="brain">
    Multi-layer memory: episodic, working, semantic, and file storage
  </Card>

  <Card title="Guardrails" icon="shield-halved">
    PII detection, schema validation, and prompt injection defense
  </Card>

  <Card title="Observability" icon="chart-line">
    OpenTelemetry tracing, LLM-as-Judge evaluation, dashboards
  </Card>
</CardGroup>

<Tabs>
  <Tab title="Skills">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/skills.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=cb19d1a22db423a0754c7839c293eead" alt="Skills architecture showing SKILL.md definitions, tools, and base components" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/skills.svg" />
      </div>
    </Frame>

    Every agent capability is defined as a composable **Skill** — a combination of SKILL.md definitions, tool bindings, prompts, guardrails, triggers, and schemas. Skills are the building blocks that make agents specialized.
  </Tab>

  <Tab title="Autonomy">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/autonomy.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=3bd03ad9e2b3d23cf36c5695abda8403" alt="Four autonomy levels from L1 Notify to L4 Autonomous" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/autonomy.svg" />
      </div>
    </Frame>

    Control how independently agents operate: **L1 Notify** (report only), **L2 Suggest** (recommend actions), **L3 Approve** (act with approval), **L4 Autonomous** (full self-directed execution). Each level is gated by RBAC policies.
  </Tab>

  <Tab title="Sandbox">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/sandbox.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=d5a4b8ca52bc8fbadf54c94fe37f9206" alt="Sandbox environment with microVM isolation and auto-destroy" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/sandbox.svg" />
      </div>
    </Frame>

    Agents execute in **ephemeral, isolated sandbox environments** — microVMs with per-tenant VPC isolation that auto-destroy after use. Agent actions never touch your production infrastructure directly.
  </Tab>

  <Tab title="Runbook">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/runbook.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=43659ce34b394eebd51a5475c4716bc1" alt="Runbook system with 325+ operations, cron scheduling, and chaining" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/runbook.svg" />
      </div>
    </Frame>

    Pre-built operational procedures agents execute step-by-step. **325+ operations** available, schedulable via cron, chainable for complex workflows, and customizable for your environment.
  </Tab>

  <Tab title="Connections">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/connections.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=26d8d857715dab0e24b85eb9c2af588f" alt="Connections hub integrating AWS, Azure, Kubernetes, Slack, GitHub, and more" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/connections.svg" />
      </div>
    </Frame>

    An **[MCP](/guide/connections/mcp)-based integrations hub** connecting agents to your infrastructure. AWS, Azure, GCP, Kubernetes, Slack, GitHub, Datadog, [Grafana](/guide/connections/grafana), and more — all through a standardized protocol.
  </Tab>

  <Tab title="Knowledge">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/knowledge.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=1ede63bb081599ba5a59382c4006a67b" alt="Knowledge base with vectorized storage and RAG retrieval" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/knowledge.svg" />
      </div>
    </Frame>

    Agents build and query a **vectorized [knowledge base](/guide/knowledge)** from your docs, runbooks, past incidents, and operational history. Continuous learning through RAG means increasingly accurate recommendations over time.
  </Tab>

  <Tab title="Topology">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/topology.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=1341ad94e388c0cb08c49adea62de2b1" alt="Topology graph showing multi-region resource mapping" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/topology.svg" />
      </div>
    </Frame>

    A **visual resource graph** mapping relationships across regions and providers. [Topology](/guide/infrastructure/topology) powers [Root Cause Analysis](/guide/incident/root-cause-analysis) — when something breaks, agents trace the impact path instantly.
  </Tab>

  <Tab title="Memory">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/memory.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=1e7c211529f5d309ac1c7d88dded607b" alt="Multi-layer memory system with episodic, working, semantic, and file memory" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/memory.svg" />
      </div>
    </Frame>

    **Multi-layer memory** gives agents persistent context: **Episodic** (past interactions), **Working** (current task), **Semantic** (learned concepts), and **File** (document storage). Agents remember your preferences and past decisions.
  </Tab>

  <Tab title="Guardrails">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/guardrails.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=c491f366262983e7121349167f04cd73" alt="Guardrails with PII detection, schema validation, and injection defense" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/guardrails.svg" />
      </div>
    </Frame>

    **Input and output safety gates** protect every agent interaction — PII detection, schema validation, prompt injection defense, and content filtering. Agents always operate within defined boundaries.
  </Tab>

  <Tab title="Observability">
    <Frame>
      <div style={{background: '#0a0a0a', borderRadius: '12px', padding: '24px', display: 'flex', justifyContent: 'center'}}>
        <img src="https://mintcdn.com/cloudthinker/AotcVtNAulbs9uu5/images/platform/observability.svg?fit=max&auto=format&n=AotcVtNAulbs9uu5&q=85&s=762c410f7ac73bd9e6d0f8c653be03d7" alt="Observability dashboard with OpenTelemetry traces and LLM evaluation" style={{maxWidth: '300px', width: '100%'}} width="280" height="200" data-path="images/platform/observability.svg" />
      </div>
    </Frame>

    Full **platform monitoring and evaluation** powered by OpenTelemetry. Track agent performance, trace execution, evaluate output quality with LLM-as-Judge, and monitor system health through built-in dashboards.
  </Tab>
</Tabs>

***

## The Agent Team

Five specialized AI agents — always on, scalable, secure, intelligent.

| Agent                              | Role                     | Primary Focus                                               |
| ---------------------------------- | ------------------------ | ----------------------------------------------------------- |
| **[Alex](/guide/agents/alex)**     | Cloud Engineer           | Cost optimization, infrastructure, multi-cloud architecture |
| **[Oliver](/guide/agents/oliver)** | Security Engineer        | Compliance, vulnerabilities, threat detection, IAM          |
| **[Tony](/guide/agents/tony)**     | Database Administrator   | Query optimization, performance tuning, data analytics      |
| **[Kai](/guide/agents/kai)**       | Kubernetes Administrator | Container orchestration, cluster optimization, workloads    |
| **[Anna](/guide/agents/anna)**     | Technology Leader        | Multi-agent coordination, strategy, executive reporting     |

```bash theme={null}
@alex analyze EC2 instances with <20% CPU utilization over 30 days
@oliver audit security groups for public access on database ports
@tony #dashboard database performance metrics for production cluster
@kai optimize pod resource allocation across all namespaces
@anna coordinate quarterly infrastructure review with all agents
```

***

## How It Works

<Steps>
  <Step title="Connect">
    Link your cloud accounts, Kubernetes clusters, and tools. Agents discover your infrastructure automatically.
  </Step>

  <Step title="Prompt">
    Use natural language with `@agent` mentions and `#tool` commands to request analysis or actions.
  </Step>

  <Step title="Execute">
    Agents analyze, recommend, and — with your approval — implement optimizations autonomously. Or set them to full autonomy and let the system self-heal.
  </Step>
</Steps>

***

## What You Can Do

<Tabs>
  <Tab title="Cost Optimization">
    ```bash theme={null}
    @alex analyze spending trends over last quarter
    @alex #recommend reserved instance purchases for stable workloads
    @alex identify unattached volumes and unused elastic IPs
    ```

    **Typical outcome**: 30–50% cost reduction with automated implementation plans
  </Tab>

  <Tab title="Security & Compliance">
    ```bash theme={null}
    @oliver perform SOC 2 Type II compliance assessment
    @oliver audit IAM policies for privilege escalation risks
    @oliver #report security posture with remediation timeline
    ```

    **Typical outcome**: 90%+ compliance score with complete audit trail
  </Tab>

  <Tab title="Database Performance">
    ```bash theme={null}
    @tony analyze slow queries on production PostgreSQL
    @tony #dashboard query performance with P95 latency trends
    @tony recommend index optimizations for high-frequency queries
    ```

    **Typical outcome**: 70% query performance improvement
  </Tab>

  <Tab title="Kubernetes Operations">
    ```bash theme={null}
    @kai analyze pod resource utilization across all clusters
    @kai identify nodes with <30% CPU utilization for consolidation
    @kai #recommend HPA policies for variable workloads
    ```

    **Typical outcome**: 25% cost reduction with improved reliability
  </Tab>

  <Tab title="Complex Operations">
    ```bash theme={null}
    @anna coordinate AWS to Azure migration with all agents
    @anna #report quarterly infrastructure review for executive team
    @anna manage security remediation project across @oliver @alex @kai
    ```

    **Typical outcome**: 75% faster project completion with unified visibility
  </Tab>
</Tabs>

***

## Next Steps

<CardGroup cols={2}>
  <Card title="Quickstart Guide" icon="rocket" href="/quickstart">
    Get running in 5 minutes with step-by-step setup
  </Card>

  <Card title="Connections" icon="plug" href="/guide/connections/overview">
    Set up integrations with your cloud providers and tools
  </Card>

  <Card title="CloudThinker Language" icon="terminal" href="/guide/language">
    Learn the complete syntax for agent and tool commands
  </Card>

  <Card title="Use Cases" icon="lightbulb" href="/guide/use-cases/actionable-dashboards">
    See real-world examples of CloudThinker in action
  </Card>
</CardGroup>
