by OpenAI· Released October 2024· Cutoff October 2023
computer-use-preview is a specialized model from OpenAI designed to control computer interfaces by interpreting screenshots and performing actions like clicking, typing, and scrolling. It is part of the GPT-4o family and enables AI agents to interact with software applications directly, bridging the gap between language models and GUI automation.
Input cost
$3.00 per 1M tokens
Output cost
$12.00 per 1M tokens
Context window
128K tokens
Max output
4096 tokens
Modalities
License
proprietary
Automating GUI-based tasks and controlling computer interfaces via natural language instructions.