The Daily Claw

The stories that matter today all share a single pressure point: the moment an agent needs to hold a million tokens and still stay compliant. The release of Sonnet 4.6, Alibaba’s sparse MoE experiment, and the launch of Cencurity prove that long-context defaults now carry every other operational requirement on their backs.

Anthropic’s Sonnet 4.6 makes the 1M-token window the new base plan

Claude Sonnet 4.6 is no longer tucked behind a beta flag—the Pro/Free tiers flip automatically, and Anthropic is promoting the 1,000,000-token context window as the new default for heavy workflows. Benchmarks now crown it with a 60.4% ARC-AGI-2 score plus gains on OS World and SWE-Bench, all just two weeks after Opus 4.6. If you are still tokenizing around Sonnet 4.5 or juggling multiple Claude sessions, treat 4.6 as the minimum expectation: rerun the long-contract clearances, the “hold the entire repo” demos, and any internal tooling that assumes a 100K window so the new baseline feels like a smooth upgrade instead of a panic.

Alibaba’s Qwen3.5-397B-A17B stacks MoE resilience across vision, code, and 1M-token contexts

The Qwen3.5-397B-A17B announcement shows how Alibaba is betting on sparse MoE to scale everything a founder would otherwise add as tooling: 397B parameters, 17B active per token, 512 experts, and early-fusion multimodal training across trillions of tokens. The throughput boost (8.6×–19× prior Qwen builds) plus a 76.5 IFBench visual instruction score means you can keep entire documentation sets, product walkthroughs, and even short videos in one context window; you no longer need stitching strategies for large art/assets or code conversions. Start routing your agentic tooling through Qwen3.5-Plus when vision or video advice is unavoidable, and expect the MoE routing logic to handle resilience when a single expert saturates during a batch process.

Cencurity keeps policy + per-user keys between your agents and your data

Cencurity is a gateway, not just a firewall: every agent call now gets a per-user key, a private proxy, and instrumentation that streams policy hits, latency, and request/response objects in real time. The dashboard gives founders clear traces, webhook alerts, and dry-run rollouts for new rules, so onboarding a new agent is paperwork-free but policy-verified. If your automations are still sharing one API key across ten assistants, start splitting by operator and route everything through Cencurity so that one compromised session doesn’t redefine what “trusted workspace” means.

Quick hits

HackMyClaw is still paying $100 to anyone who can coax Fiu into leaking secrets via email, making it a perfect rehearsal for your own prompt-injection resilience drills.
PicoClaw now runs on $10 hardware with under 10MB RAM and 1-second cold starts, so your edge deployments can stay cheap and responsive without Docker overhead.
BotBrain (aka BotBot) packages one brain for ROS2 robots plus Pro telemetry, meaning you can podium a robot demo with fleet telemetry, LoRa/5G halo, and chat controls right out of the box.
Drivebase unifies S3, Google Drive, and local files with Bun, smart routing, and a GraphQL API so multi-cloud file flows stop being a bespoke engineering project.
Vinyl Cache opens 100 invitation slots for Forgejo mirrors before March 20—mirror your varnishcache-aligned repos now so you keep commit history, CI, and issues intact before the link expires.

Calm observability dashboard keeping watch over automation ops — Long-context, high-security ops still look calm because you own the kill switch.