Gemini · Multimodal

Gemini multimodal plan

Plan combining text + image/video inputs.

All topics / Gemini prompts

Replace highlighted brackets, then copy.

Given text + image/video: summarize content, answer [question], and output next-step suggestions in bullets. Keep to [words] words.
  • question - replace the bracketed field with your info
  • words - replace the bracketed field with your info
  • Question
  • Words limit
  • Focus (objects/actions)
  • Tone
  • Summary
  • Answer
  • Next steps
  • State confidence if visuals are unclear.
  • Separate description from inference.
  • Cite any text detected in the media.