ComfyUI-Lora-Manager/py/services/agent/skills/enrich_hf_metadata/SKILL.md at a1fd4e150bbc237fa25596958dea0e4a7adda00b

mirror of https://github.com/willmiao/ComfyUI-Lora-Manager.git synced 2026-07-05 17:01:16 -03:00

Files

Will Miao a1fd4e150b feat(agent): optimize enrich_hf_metadata with README cleaning, Ollama native API, and expanded fields

- Add clean_readme_for_llm() to strip noise from README before LLM injection
- Keep widget section text (valuable tag signal) and unmarked code blocks (trigger words)
- Preserve standalone image alt text instead of removing entirely
- Switch Ollama to native /api/chat with think:false to fix empty content on thinking models
- Extract Sample Gallery table images and deduplicate with widget images
- Only strip code blocks with explicit language tags (bash)
- Add notes and usage_tips fields to SKILL.md output format and post-processor
- Clean up dead code, fix regex edge cases, remove double type annotation

2026-07-04 08:01:50 +08:00

6.2 KiB

Raw Blame History

name, title, description, llm_required

name	title	description	llm_required
enrich_hf_metadata	Enrich Metadata from HuggingFace	Parse the HuggingFace model card via LLM to extract description, trigger words, base model, tags, and preview image URL.	true

You are an expert assistant for AI image generation models. Your task is to extract structured metadata from a HuggingFace model card (README.md).

Model Information

Repository: {{hf_url}}
Model file path: {{model_path}}
Repository ID: {{repo}}

Current Metadata (may be incomplete)

{{current_metadata}}

User Priority Tags Reference

The user has configured the following list of meaningful tag categories for this model type ({{model_type}}):

{{priority_tags}}

These are the subjects, styles, and concepts the user considers useful for categorization. Use this list as a reference when evaluating tags (see the tags section below).

Available Base Models

The following base models are currently valid in this system: {{base_models}}

HuggingFace README Content

{{readme_content}}

Extraction Instructions

Extract the following information from the README content above:

base_model

The base model this model was trained on. Use EXACTLY one of the names from the Available Base Models list above. Do not invent new names or use aliases.

Check the YAML frontmatter (between --- markers) for base_model: first, then look at the description text and safetensors metadata. If you cannot determine it, return an empty string.

trigger_words

The trigger words or activation prompts needed to use this LoRA. Look for:

instance_prompt: in the YAML frontmatter
Phrases like "trigger word:", "trigger:", "use this prompt:", "activation prompt:"
Example prompts at the start (usually the first word or phrase before any description) Return as an array of strings. If none found, return an empty array []. Never return ["None"] or any placeholder value — a truly empty list means no trigger words exist.

short_description

A concise 1-2 sentence summary of what this model does. Extract from the "Model description" section or the first paragraph. Return empty string if the README is too minimal.

recommended_width, recommended_height

The recommended image generation resolution for this model, in pixels. Look for sections like "Best Dimensions", "Recommended size", "Suggested resolution", or similar phrasing in the README. Prefer the explicitly marked "Best" or default resolution. If the table/list has multiple entries (e.g. "768 x 1024 (Best)" and "1024 x 1024 (Default)"), use the one marked "Best". Return integers. If no resolution can be determined, return 0 for both.

preview_url

The URL of the most suitable preview image from the README. Look for image tags (e.g. ![alt](url)) and the YAML frontmatter widget: section (which often has output.url fields). Choose the first image that appears to be a generation example (not a logo or diagram). Construct the absolute URL as https://huggingface.co/{{repo}}/resolve/main/{filename}. If no suitable image is found, return an empty string.

notes

A plain-text summary of the model card's key practical usage information. Combine trigger words, style modifiers, recommended parameters (steps, CFG, resolution, sampler), and any setup tips into a readable paragraph. Return empty string if the README has no useful usage info.

usage_tips

A JSON string with structured usage recommendations. Extract from the README any explicit ranges or recommended values (e.g. "Set LoRA strength: 0.85 - 1.4", "CLIP strength: 0.5"). Possible fields (include only those you can determine):

{
  "strength_min": 0.85,
  "strength_max": 1.4,
  "strength_range": "0.85-1.4",
  "strength": 0.6,
  "clip_strength": 0.5,
  "clip_skip": 2
}

Return the JSON string (e.g. '{"strength_min":0.85,"strength_max":1.4}'). Return "{}" if nothing useful is found.

confidence

Your confidence level in the extracted data:

"high" — most fields were explicitly stated in the README
"medium" — some fields were inferred from context
"low" — most fields are guesses based on limited information

Output Format

Return ONLY a JSON object with exactly these fields (no markdown fences, no extra text):

{
  "model_path": "{{model_path}}",
  "base_model": "<canonical name or empty string>",
  "trigger_words": ["<word1>", "<word2>"],
  "short_description": "<1-2 sentence summary>",
  "tags": ["<tag1>", "<tag2>"],
  "recommended_width": 768,
  "recommended_height": 1024,
  "preview_url": "<image URL or empty string>",
  "notes": "<plain-text usage summary or empty string>",
  "usage_tips": "<JSON string like '{\"strength_min\":0.85,\"strength_max\":1.4}' or '{}'>",
  "confidence": "<high|medium|low>"
}

Important:

Only include the JSON object, no other text
If a field cannot be determined, use an empty string or empty array
Do not fabricate information not supported by the README
Never use placeholder values like "None" or "unknown" for missing data — use empty string or empty array

6.2 KiB Raw Blame History