Research (Deep Research)
What · Autonomous, multi-step web research. The AI scans dozens of sources, summarises and cites.
What for · Market analyses, competitor comparisons, background briefings before calls.
Start deep research · 12 sources checked
Available in: ChatGPT, Claude, Gemini, Perplexity, Grok
Vision (image understanding)
What · You upload an image, screenshot, PDF or whiteboard photo. The AI reads, describes and analyses.
What for · Transcribe whiteboard photos, read out charts, UI reviews, extract receipts.
Drag and drop · image thumbnail
Available in: ChatGPT, Claude, Gemini
Computer-Use
What · The AI sees your screen, clicks, types and scrolls inside your apps. Like a virtual co-worker.
What for · Email triage, tidying folders, recurring workflows in local apps.
Desktop capture · AI working
Available in: Claude (Computer-Use + Dispatch), ChatGPT (Agent), Gemini
Best to set up on a separate machine without sensitive data.
Voice
What · You speak, the AI listens. Either as natural conversation or as dictation into any text field (Wispr Flow).
What for · Dictating emails, brainstorming on a walk, prompt input without a keyboard.
Mic pulse · waveform
Available in: ChatGPT Advanced Voice, Gemini Live, Grok, Wispr Flow for any text field
Projects
What · Persistent workspace with custom instructions, uploaded files, shared chat history. At Claude it's called Projects, at OpenAI also Projects, at Google it's Gems.
What for · Recurring workflows (quarterly reports, contract review, competitor analysis) where the context stays the same.
Folder · about-me.md · schreibstil.md · nicht-tun.md
Available in: Claude Projects, ChatGPT Projects (up to 40 files), Gemini Gems
Agent mode
What · The AI runs multi-step tasks on its own. Browser, mail, calendar, spreadsheets. You give the direction, it does the work.
What for · „Plan a trip and book the flights“, „analyse three competitors and build a deck“, morning email triage.
Task list · checkmarks turn green
Available in: ChatGPT Agent (40-400 tasks/month depending on plan), Claude Dispatch, Gemini
Connect your phone (optional)
What · Pair Cowork or ChatGPT with your phone and bring the AI on the go. With Claude it's Dispatch — you hand the model a task on the road and it runs on your desktop.
What for · Quick sparring runs on the way to a meeting, dictating briefings, kicking off tasks without opening the laptop.
iOS/Android app · microphone · sync to desktop
Available in: Claude Mobile + Dispatch, ChatGPT Mobile, Gemini Mobile
Which modes you actually need depends on the stack you work with. Exactly what the next section is about.