Live 1m ago · CISA and allies publish secure AI-in-OT guidance for agentic control loops 1m ago · DHS issues cross-sector AI roles framework for critical infrastructure 17h ago · Fortune 500 agent adoption tops 80% in Microsoft telemetry

CISA and allies publish secure AI-in-OT guidance for agentic control loops

CISA and six allied cyber agencies released joint guidance on securely integrating AI—including autonomous agents—into operational technology environments that drive physical processes.

May 24, 2026 · operational-technology agentic-ai industrial-control-systems

Research worth reading

Browse all →

Research · Independent

Small Language Models are the Future of Agentic AI

A position paper arguing that small language models are often a better fit than large ones for agentic systems because they are cheaper, easier to deploy, and operationally better matched to repetitive tool-using workflows.

May 17, 2026 intermediate

Research · Independent

I’m sorry, but I can’t reliably identify the most relevant papers from the past 7 days without live access to arXiv, Semantic Scholar, and recent announcements.

A live literature search is required to avoid fabricating citations or missing the newest builder-relevant agentic AI papers.

May 16, 2026 intermediate

Research · Independent

Anemoi: Agent-to-Agent Coordination via Consensus-Based Planning

An agentic framework replacing centralized coordination with direct agent-to-agent communication for scalable multi-agent task solving on GAIA.

May 14, 2026 intermediate

Tools builders are using

All 20 tools →

Arize Phoenix

Arize AI

OpenTelemetry-native LLM observability and evaluation.

observability open-source

AutoGen

Microsoft Research

Conversational multi-agent framework with strong reasoning patterns.

orchestration open-source

Browserbase

Hosted, isolated browsers for agent automation with session replay.

sandbox paid

Continue

Open-source coding-agent IDE extension for VS Code and JetBrains.

distribution open-source

CrewAI

Role-based multi-agent framework with declarative crew definitions.

orchestration freemium

E2B

Cloud sandboxes for code-running AI agents.

sandbox freemium

Haystack

deepset

Pipelines for retrieval-heavy agent workloads.

framework open-source

Helicone

Lightweight LLM observability with a proxy-first model.

observability freemium

The Agent Brief

Three things in agentic AI, every Tuesday.

What changed, what matters, what builders should do next. No hype. No paid placement.

Build

All guides →

Ship a browser agent on Browserbase
A production browser-agent stack with anti-bot resilience, session replay, and a kill switch.
3-4 hours · intermediate
Build a replay-based eval set in a weekend
How to capture, redact, and score real production sessions to evaluate agent candidates.
6-8 hours · intermediate
Cost controls for agent workloads
Token budgets, fallback tiers, and the dashboards that catch runaway runs before they hurt.
4 hours · intermediate
Add long-term memory with Letta
Wire a hierarchical memory store into an existing agent and audit what it remembers.
2 hours · intermediate
Build your first production agent with LangGraph
A 90-minute walkthrough that ships a tool-using agent with persistent state, retries, and observability.
90 minutes · intermediate

Use cases

All →

Customer support: agent-led deflection at the contact moment
How leading B2C teams are reducing tier-1 ticket volume by 35-55% with a tightly-scoped support agent.
Consumer software · Tier-1 ticket deflection
Engineering: internal-tool agents over the API graph
How platform teams replace one-off internal dashboards with a shared agent over their API graph.
Internal platforms · Tool consolidation
Sales: pre-call research and account briefs
A research agent assembles a 1-page brief 30 minutes before every external call.
B2B SaaS · Rep productivity
Security: alert triage and enrichment
An agent enriches and triages SOC alerts, halving the load on tier-1 analysts.
Cybersecurity · Analyst leverage
Legal: contract redlining assistant
A focused agent flags deviations from a playbook and proposes redlines for a human to approve.
Legal · Throughput