🦾 [Browser Automation] Possibilities and Limitations:Browser Control Boundaries from a CDP Perspective
· 12 min read

If we consider all human operations on a computer as the universal set, then the operational scope of Computer, Browser, CDP, and puppeteer can be categorized as follows:
- Computer: The universal set of all operations
- Browser: App-level permissions. Browser restricts many capabilities for security, such as direct access to local files
- CDP: Focused on debugging capabilities. Non-debugging information of the browser (such as bookmarked web pages) cannot be accessed
- puppeteer: Built on top of CDP, but doesn't utilize all CDP APIs, so its capabilities are a subset of CDP
In Browser-Use scenarios, unlike VNC which is a more general screen casting solution, CDP has capability boundaries. Therefore, understanding its strengths and limitations is significant for overall architectural design and future evolution direction.

