todai — Wednesday, 29 April

TODAY'S ITEMS

1. Claude Code v2.1.121: alwaysLoad for MCP Servers, PostToolUse Hooks Go Universal, Multi-GB Memory Leaks Fixed

Released today, v2.1.121 adds the alwaysLoad:true option to MCP server configs — every tool on that server is always available without tool-search deferral — plus PostToolUse hooks for all tools (not just MCP) with a new hookSpecificOutput.updatedToolOutput field that lets any hook rewrite any tool's output before Claude sees it. Three multi-GB memory leaks are patched: unbounded RSS growth from image-heavy sessions, a /usage leak on large transcript histories, and a progressive leak from long-running failing tools.
Source: GitHub
Why it matters: If MCP tools keep getting deferred when you need them, add alwaysLoad:true to that server's config — one line, fixes immediately.
Verified

2. Opus in Claude Code for Pro Users: Clarified — Existing Subscribers Were Never Affected

A Reddit post yesterday (383 upvotes, r/ClaudeCode) flagged that Anthropic quietly added an "additional purchase" note to Opus access in Claude Code for Pro users. Anthropic's head of growth confirmed on X that this was a small test on approximately 2% of new prosumer sign-ups only; existing Pro and Max subscribers were never part of the pricing test, and Sonnet 4.5 remains the default with Opus 4.7 on Max plans as before. Ars Technica confirmed the clarification today.
Source: Ars Technica
Why it matters: If you're on an existing Pro or Max plan, nothing has changed — but Anthropic is clearly testing how to price compute-heavy agentic workloads, so expect more tiering experimentation ahead.
Verified

3. Claude's Tokenizer Charges 2-3× More Tokens for Non-English Text Than OpenAI's

Researcher @arankomatsuzaki published comparative tokenizer analysis (141K impressions) showing Claude's tokenizer is materially more expensive for non-English content: Hindi costs 3.24× more tokens than equivalent text in OpenAI's tokenizer (vs OpenAI's 1.37×), Arabic is 2.86× (vs 1.31×), and Chinese is 1.71× (vs 1.15×), normalised to the same English token count baseline.
Source: Twitter/@arankomatsuzaki
Why it matters: If your agents process non-English content at scale, budget 2-3× more tokens per request than your English testing would suggest — the cost cliff is real and unannounced.
Emerging

NEW TOOL / PRODUCT SPOTLIGHT

luongnv89/claude-howto — A visual, example-driven guide to Claude Code: basic concepts through advanced agents, with copy-paste templates for slash commands, CLAUDE.md memory, skills, hooks (all 28 hook events, all 5 hook types), subagents, MCP, and plugins. MIT licensed, actively maintained, synced with every Claude Code release (currently v2.1.119+). Get started in 15 minutes per module:

git clone https://github.com/luongnv89/claude-howto
cp claude-howto/01-slash-commands/*.md .claude/commands/
cp claude-howto/02-memory/project-CLAUDE.md ./CLAUDE.md
cp -r claude-howto/03-skills/code-review ~/.claude/skills/

https://github.com/luongnv89/claude-howto

PROMPT OF THE DAY

# Add to your CLAUDE.md or AGENTS.md for any project with build
# or long-running commands:

For any command likely to produce large or continuous output
(docker build, docker compose up, npm install, long test suites,
compilation):

1. Run it as a subagent Task so the output does not consume
   this session's context window
2. Pipe output to a temp file:
   command > /tmp/build.log 2>&1 &
3. Monitor with:
   tail -20 /tmp/build.log | grep -E "error|warn|done"
4. Do NOT read the full log file unless errors are detected
5. If a command takes longer than expected, check whether it is
   still running before assuming it failed

Prevents context window pollution from build output — sourced from a 686-upvote r/LocalLLaMA discussion on agentic coding failures. Add this block to your project CLAUDE.md and it applies to every Claude Code session in that project. https://www.reddit.com/r/LocalLLaMA/comments/1sxqa2c/im_done_with_using_local_llms_for_coding/

LANDSCAPE NOTES

Microsoft VibeVoice trending on HN today (251 pts, 1,500+ stars) — open-weight speech-to-text with built-in speaker diarisation, MIT licensed, Python. Worth watching, but a security researcher found the companion "Vibing" app ships a closed binary that may send data to Microsoft Azure — use the base VibeVoice model only until this is resolved. https://github.com/microsoft/VibeVoice
GitHub Copilot switches to usage-based billing June 1 — agentic workflows metered separately from seat licences. Relevant if you're comparing Claude Code vs Copilot cost projections for heavier agentic use. https://x.com/github/status/2048794729274278258
Unconfirmed: Anthropic reportedly building a shared Tasks list visible across both Claude Dispatch and Claude Code, plus mobile sidebar customisation. Single-source screenshot with 16 impressions — low confidence, but directionally interesting for multi-surface agentic coordination. https://x.com/n5waha/status/2048937399544377506