It uses a mix of scripting and screenshots to view/control browser... if u don't want it to take screenshots, you can instruct it to control browser with scripting only.
its 180 bc avoids having to use yet another MCP server (with MCP token chat window bloat) like Playwright or DevTools