OpenAI
OpenAI and Amazon launch Stateful Runtime Environment for production-grade agents in Bedrock//OpenAI and Amazon have jointly built a Stateful Runtime Environment that runs natively in Amazon Bedrock, enabling developers to deploy multi-step AI agents with persistent state, context, and governance controls. The runtime eliminates the need for manual orchestration by handling state management, tool invocations, error handling, and long-running task resumption automatically.
vLLM v0.16.0 brings async scheduling with pipeline parallelism, 30.8% throughput gains//vLLM's latest release introduces async scheduling combined with pipeline parallelism, delivering significant performance improvements and new WebSocket-based realtime audio streaming capabilities. The update adds support for 12+ new model architectures and major enhancements to speculative decoding, RLHF workflows, and Intel XPU platforms.
GitHub
GitHub's Enterprise AI Controls reaches general availability with agent governance tools//GitHub has released Enterprise AI Controls and the agent control plane as generally available features, giving enterprise administrators deeper oversight and auditability of AI agent usage across their organizations. The release includes new capabilities for discovering agent activity, configuring enterprise agent policies via API, and managing custom agent standards through fine-grained permissions.
Devin 2.2 Launch: 3x Faster Startup, Desktop Testing, v3 API Exit Beta//Devin releases version 2.2 with significant performance improvements, new desktop testing capabilities, and official v3 API launch. The update includes a redesigned UI, faster integrations with Slack and Linear, and support for end-to-end testing across Linux desktop applications.
GitHub
GitHub Enterprise Server 3.20 RC adds immutable releases, enhanced secret scanning, and backup service//GitHub Enterprise Server 3.20 release candidate introduces several major features including immutable releases for supply chain protection, enhanced secret scanning with validity checks and enterprise-level bypass controls, and a new backup service that replaces the need for separate backup utilities. The release also adds enterprise team management capabilities and new security roles for simplified governance.
Vercel
Vercel opens Chat SDK in public beta, unifies bot development across Slack, Teams, Discord, and five other platforms//Vercel has open-sourced the Chat SDK, a TypeScript library that allows developers to write chatbot logic once and deploy across Slack, Microsoft Teams, Google Chat, Discord, GitHub, and Linear. The SDK features event-driven architecture with type-safe handlers, JSX-based UI components that render natively on each platform, and pluggable state management adapters.
Cloudflare
Cloudflare One becomes first SASE platform with post-quantum encryption across all components//Cloudflare One now offers post-quantum hybrid ML-KEM encryption across its entire Secure Access Service Edge platform, including Secure Web Gateway, Zero Trust, and Wide Area Network services. The expansion covers Cloudflare IPsec (in closed beta) and Cloudflare One Appliance (generally available), enabling organizations to secure their enterprise network traffic against future quantum threats ahead of NIST's 2030 cryptographic transition deadline.
Anthropic
Anthropic launches Claude Code Security in limited preview; found 500+ zero-day vulnerabilities in open-source code//Claude Code Security, a new AI-powered capability within Claude Code, automatically scans codebases for complex vulnerabilities and suggests patches—finding security issues that traditional static analysis tools miss. Available in limited research preview for Enterprise and Team customers, the tool leverages recent improvements in Claude's cybersecurity abilities demonstrated through over 500 zero-day discoveries in production open-source repositories.
Google
Google releases Gemini 3.1 Pro with 77.1% ARC-AGI score, doubling reasoning performance//Google has released Gemini 3.1 Pro, an upgraded AI model with significantly improved reasoning capabilities now available across consumer and developer platforms. The model achieves a verified 77.1% score on ARC-AGI-2 benchmarks—more than double the performance of its predecessor—and is designed for complex problem-solving tasks requiring advanced reasoning.
NVIDIA and SGLang optimize DeepSeek for GB300 NVL72, achieving 226 tokens per second in 128K-token inference//NVIDIA and the SGLang team have published optimizations for running DeepSeek R1 on the GB300 NVL72 GPU, leveraging prefill-decode disaggregation, pipeline parallelism, and expert parallelism to achieve 226 tokens per second per GPU on long-context workloads. The optimization demonstrates a 1.53x throughput advantage over GB200 under identical conditions, with further gains possible through multi-token prediction.
Cursor 2.5 launches plugin marketplace and async subagents for multi-file workflows//Cursor's latest release introduces a plugin marketplace with pre-built integrations from partners like AWS, Figma, and Stripe, plus asynchronous subagents that allow the parent agent to continue working while background tasks execute. The update also adds fine-grained sandbox network access controls for enterprise security policies.
ElevenLabs expands ElevenAgents with versioning, RAG tools, and content guardrails//ElevenLabs released significant updates to its ElevenAgents API, introducing agent versioning, a new documentation search tool for RAG, MCP tool support, and configurable content moderation guardrails. The update also includes new endpoints for tracking conversation users and expanded SDK support across Python, JavaScript, and widget packages.
Railway releases AI agent for canvas, Postgres metrics, and network flow visualization//Railway is shipping three major features: a conversational AI agent for infrastructure management, dedicated Postgres database metrics with query statistics, and network flow visualization showing real-time traffic between services. The AI agent debuts in Priority Boarding, while Postgres metrics and network flows graduate to general availability.
Cloudflare
Cloudflare Python SDK v5.0.0-beta.1 introduces major breaking changes and 40+ new API resources//Cloudflare released the first beta version of Python SDK v5.0.0, featuring significant breaking changes driven by OpenAPI schema improvements and code generation updates. The release adds over 40 new API resources including AI-powered features, brand protection tools, D1 database management, and Real-time Kit integrations, alongside general fixes for type inference, request handling, and response parsing.
GitHub
GitHub launches Agentic Workflows in technical preview, enabling AI-driven repository automation in Markdown//GitHub Agentic Workflows let developers automate repository tasks using AI agents within GitHub Actions by writing workflows in plain Markdown instead of YAML. The feature, available via the `gh aw` CLI extension, supports natural language automation for issue triage, PR reviews, CI failure analysis, and repository maintenance with security-first defaults including read-only permissions and sandboxed execution.
AI2 launches MolmoSpaces, an open simulation platform for embodied AI with 230,000 scenes and 42 million grasps//MolmoSpaces is a large-scale, open-source ecosystem for training and evaluating embodied AI systems, unifying over 230,000 indoor scenes, 130,000+ object models, and 42 million annotated robotic grasps. The platform features physics-grounded simulation, a systematic benchmark for measuring generalization across multiple axes, and compatibility with major simulators like MuJoCo, ManiSkill, and NVIDIA Isaac.
Supabase
Supabase acquires Hydra team to build open data warehouse for Postgres//Supabase is welcoming Joe Sciarrino, co-creator of Hydra, to lead the development of Supabase Warehouse and an Open Warehouse Architecture initiative. The team will leverage pg_duckdb, an open-source Postgres extension that accelerates analytics queries by 600x, to enable serverless analytics workflows on Postgres with object storage integration.
OpenAI
OpenAI updates GPT-5.2 Instant, launches GPT-5.3-Codex with 25% faster performance//OpenAI has released multiple model updates including an improved GPT-5.2 Instant with more measured responses and clearer output on advice-seeking questions. The company also introduced GPT-5.3-Codex, a unified coding model combining code generation with general-purpose reasoning, delivering 25% faster performance and new benchmark highs.
Unsloth ships MoE training kernels with 12x speedup and 35% lower VRAM usage//Unsloth introduced custom Triton kernels and optimizations for training Mixture of Experts (MoE) language models, delivering 12x faster training speeds with over 35% reduction in VRAM consumption and support for 6x longer context windows. The update supports popular MoE models including Qwen3, DeepSeek R1/V3, and GPT-OSS, working across data-center and consumer GPUs.
OpenAI
OpenAI deploys ChatGPT to Pentagon's GenAI.mil platform for 3 million personnel//OpenAI is bringing a custom version of ChatGPT to GenAI.mil, the Department of Defense's secure enterprise AI platform used by military and civilian personnel. The deployment includes built-in safety controls and data isolation safeguards to protect sensitive government information while enabling service members to access AI capabilities for operational and administrative tasks.
Transformers.js v4 Preview Debuts on NPM with New WebGPU Runtime and 10x Build Speed Gains//Transformers.js v4 preview is now available on NPM under the `@next` tag, bringing a complete rewrite with a new WebGPU runtime and major performance improvements. The release includes support for ~200 model architectures, cross-runtime compatibility (browsers, Node, Bun, Deno), and architectural optimizations that deliver 4x speedups for embedding models and 10x faster builds.
OpenAI
OpenAI Launches Trusted Access for Cyber, Commits $10M in API Credits for Defensive Security Work//OpenAI is introducing Trusted Access for Cyber, an identity-based framework that grants priority access to GPT-5.3-Codex and other frontier models for cybersecurity professionals and researchers. The initiative includes $10 million in API credits through the scaled Cybersecurity Grant Program to accelerate vulnerability discovery and remediation in open source and critical infrastructure.
OpenAI
OpenAI Launches Frontier Platform for Enterprise AI Agent Deployment and Management//OpenAI has introduced Frontier, a comprehensive platform designed to help enterprises build, deploy, and manage AI agents at scale. The platform provides AI coworkers with shared business context, integrated execution environments, and governance capabilities—enabling organizations to move beyond isolated agent pilots to production-grade AI systems that work across multiple applications and data sources.
Supabase
Supabase adds PrivateLink, Ethereum integration, and Claude connector//Supabase released multiple major features in February 2026 including PrivateLink for private AWS connectivity, direct Ethereum blockchain querying via SQL, and official Claude integration. The update also includes a breaking change disabling pg_graphql by default on new projects for improved security posture.
OpenAI
OpenAI releases GPT-5.3-Codex with agentic coding capabilities; achieves new SWE-Bench Pro high and 25% faster performance//GPT-5.3-Codex is OpenAI's most capable coding model to date, combining frontier coding performance with advanced reasoning and agentic capabilities. The model sets new benchmarks on SWE-Bench Pro and Terminal-Bench 2.0, while operating 25% faster than its predecessor and enabling developers to delegate complex, long-running tasks without losing context during iteration.
ElevenLabs ships Eleven v3 GA, WAV support, and Agents Platform enhancements//ElevenLabs released Eleven v3 out of alpha with improved stability, accuracy, and lower latency. The update includes WAV output format support for Text-to-Dialogue, expanded Agents Platform capabilities with branch renaming and guardrails, and multiple SDK updates across Python, JavaScript, React, and widget packages.
OpenAI
OpenAI launches Codex app for macOS with multi-agent orchestration and skill framework//OpenAI has released the Codex app for macOS, a dedicated interface for managing and running multiple coding agents in parallel on long-running tasks. The release includes expanded access to Codex through ChatGPT Free and Go plans, doubled rate limits across paid tiers, and a new skills framework that extends Codex beyond code generation to handle complex workflows and integrations.
Harvey scales legal knowledge coverage to 60+ jurisdictions with autonomous agent pipeline//Harvey has built "The Data Factory," an automated system using AI agents to discover, validate, and integrate legal data sources at scale. Since August 2025, the pipeline has expanded knowledge source coverage from 6 to 60+ jurisdictions and integrated over 400 legal data sources, enabling agents to handle complex queries across global legal databases without manual setup.