* feat: add meme-generation skill * Reduce meme skill prompt cost with tighter selection rules * feat(skill): overhaul meme-generation into real image generator Move from skills/creative/ to optional-skills/creative/ (niche skill, not needed by default). Replace prompt-only meme concept brainstormer with actual meme image generation: - Python script using Pillow to overlay text on template images - 10 curated templates with hand-tuned text positioning - Dynamic access to ~100 popular imgflip templates via public API - Custom image mode (--image): use AI-generated or any image as base - Two text modes: overlay (white+outline on image) or bars (black bars) - Vision verification workflow: use vision_analyze to QA the result - Auto-scaling font with pixel-accurate word wrapping - Template search via --search - No API keys required Original skill concept by adanaleycio (PR #1771), overhauled with image generation and custom image support. --------- Co-authored-by: adanaleycio <atillababa767@gmail.com>
5.3 KiB
5.3 KiB
name, description, version, author, license, metadata
| name | description | version | author | license | metadata | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| meme-generation | Generate real meme images by picking a template and overlaying text with Pillow. Produces actual .png meme files. | 2.0.0 | adanaleycio | MIT |
|
Meme Generation
Generate actual meme images from a topic. Picks a template, writes captions, and renders a real .png file with text overlay.
When to Use
- User asks you to make or generate a meme
- User wants a meme about a specific topic, situation, or frustration
- User says "meme this" or similar
Available Templates
The script supports any of the ~100 popular imgflip templates by name or ID, plus 10 curated templates with hand-tuned text positioning.
Curated Templates (custom text placement)
| ID | Name | Fields | Best for |
|---|---|---|---|
this-is-fine |
This is Fine | top, bottom | chaos, denial |
drake |
Drake Hotline Bling | reject, approve | rejecting/preferring |
distracted-boyfriend |
Distracted Boyfriend | distraction, current, person | temptation, shifting priorities |
two-buttons |
Two Buttons | left, right, person | impossible choice |
expanding-brain |
Expanding Brain | 4 levels | escalating irony |
change-my-mind |
Change My Mind | statement | hot takes |
woman-yelling-at-cat |
Woman Yelling at Cat | woman, cat | arguments |
one-does-not-simply |
One Does Not Simply | top, bottom | deceptively hard things |
grus-plan |
Gru's Plan | step1-3, realization | plans that backfire |
batman-slapping-robin |
Batman Slapping Robin | robin, batman | shutting down bad ideas |
Dynamic Templates (from imgflip API)
Any template not in the curated list can be used by name or imgflip ID. These get smart default text positioning (top/bottom for 2-field, evenly spaced for 3+). Search with:
python "$SKILL_DIR/scripts/generate_meme.py" --search "disaster"
Procedure
Mode 1: Classic Template (default)
- Read the user's topic and identify the core dynamic (chaos, dilemma, preference, irony, etc.)
- Pick the template that best matches. Use the "Best for" column, or search with
--search. - Write short captions for each field (8-12 words max per field, shorter is better).
- Find the skill's script directory:
SKILL_DIR=$(dirname "$(find ~/.hermes/skills -path '*/meme-generation/SKILL.md' 2>/dev/null | head -1)") - Run the generator:
python "$SKILL_DIR/scripts/generate_meme.py" <template_id> /tmp/meme.png "caption 1" "caption 2" ... - Return the image with
MEDIA:/tmp/meme.png
Mode 2: Custom AI Image (when image_generate is available)
Use this when no classic template fits, or when the user wants something original.
- Write the captions first.
- Use
image_generateto create a scene that matches the meme concept. Do NOT include any text in the image prompt — text will be added by the script. Describe only the visual scene. - Find the generated image path from the image_generate result URL. Download it to a local path if needed.
- Run the script with
--imageto overlay text, choosing a mode:- Overlay (text directly on image, white with black outline):
python "$SKILL_DIR/scripts/generate_meme.py" --image /path/to/scene.png /tmp/meme.png "top text" "bottom text" - Bars (black bars above/below with white text — cleaner, always readable):
python "$SKILL_DIR/scripts/generate_meme.py" --image /path/to/scene.png --bars /tmp/meme.png "top text" "bottom text"
--barswhen the image is busy/detailed and text would be hard to read on top of it. - Overlay (text directly on image, white with black outline):
- Verify with vision (if
vision_analyzeis available): Check the result looks good:If the vision model flags issues (text hard to read, bad placement, etc.), try the other mode (switch between overlay and bars) or regenerate the scene.vision_analyze(image_url="/tmp/meme.png", question="Is the text legible and well-positioned? Does the meme work visually?") - Return the image with
MEDIA:/tmp/meme.png
Examples
"debugging production at 2 AM":
python generate_meme.py this-is-fine /tmp/meme.png "SERVERS ARE ON FIRE" "This is fine"
"choosing between sleep and one more episode":
python generate_meme.py drake /tmp/meme.png "Getting 8 hours of sleep" "One more episode at 3 AM"
"the stages of a Monday morning":
python generate_meme.py expanding-brain /tmp/meme.png "Setting an alarm" "Setting 5 alarms" "Sleeping through all alarms" "Working from bed"
Listing Templates
To see all available templates:
python generate_meme.py --list
Pitfalls
- Keep captions SHORT. Memes with long text look terrible.
- Match the number of text arguments to the template's field count.
- Pick the template that fits the joke structure, not just the topic.
- Do not generate hateful, abusive, or personally targeted content.
- The script caches template images in
scripts/.cache/after first download.
Verification
The output is correct if:
- A .png file was created at the output path
- Text is legible (white with black outline) on the template
- The joke lands — caption matches the template's intended structure
- File can be delivered via MEDIA: path