# AI & Agent
All posts in the AI & Agent category.
Letting an Agent Click Into Cross-Origin Iframes: How chrome-use Took On This Hard Problem
When connecting an AI agent to a browser, the hardest part isn’t opening pages. It’s the forms hidden inside cross-origin iframes: the agent can read them and fill them in, but it just can’t click “Save.” Here’s the story of how we tackled that hard problem.
iphone-use — Computer-use, but for the iPhone
An open-source agent control layer for a real iPhone: live browser remote + a see/act HTTP API for AI agents. Three input layers (Shortcut verbs / element tree / pixel fallback), clean CJK input, hardware-validated.
Is the `cache 4m23s` Line in the Status Bar Actually Accurate?
There’s a prompt-cache countdown line in the status bar I wrote for Claude Code. Where it anchors, how the formula works, and when it lies to you — let’s pull apart the source code alongside a real transcript.
Dissecting the Implementation of the /goal Command
Based on the Claude Code 2.1.139 headless source code, this breaks down how the /goal command drives the Agent to keep working until a condition is met through a session-level Stop hook.
claude agents and /bg: An Analysis of the Agent View Implementation
Based on the Claude Code 2.1.139 source code, this article breaks down how claude agents, --bg, and /bg share the background job model, daemon, and FleetView TUI.
If You Want to Automate Your Own WeChat Account, First Think Through These Three Layers
If the goal is “automation for your own account,” what really needs to be separated first is not a feature checklist, but the listening layer, execution layer, and storage layer.
From Assisted Coding to Integrated R&D: Putting AI Programming into Team Practice (PPT)
A Slidev-style Chinese presentation translated into English, explaining Agentic Coding, Skills and MCP, AGENTS.md and CLAUDE.md, sandboxing and permission controls, and how a team can truly operationalize AI programming with Plans, a YApi Skill, and docs-sync.
How QClaw Works: How It Turns OpenClaw into a Desktop Application
This article goes beyond the broad “control plane / execution plane” framing and instead walks through the current implementation piece by piece—covering the bridging layer, IPC, configuration fields, the WeChat flow, rollback mechanisms, and the evidence index—to explain how QClaw organizes OpenClaw into a deliverable desktop runtime.
OpenClaw v2026.3.8 Release Analysis: Backup, Remote Gateway, Talk Mode, and Multi-Endpoint Routing Continue to Improve
Based on the official release notes, this article summarizes the key changes in OpenClaw v2026.3.8: backup commands arriving in the main workflow, macOS remote gateway onboarding, Talk silence timeout, Brave search integration, ACP receipts, and multi-platform routing fixes.
