ComfyUI-Lora-Manager/py/services/agent/skills/enrich_hf_metadata/SKILL.md at a8adcaf0238738d54875f7caf460d7be4467bd48

mirror of https://github.com/willmiao/ComfyUI-Lora-Manager.git synced 2026-07-05 17:01:16 -03:00

Files

Will Miao a8adcaf023 feat(agent): improve enrich_hf_metadata skill with priority_tags, preview_url fix, civitai.trainedWords

- Add identify_model_type() helper to determine lora/checkpoint/embedding
- Pass priority_tags from user settings to LLM prompt for tag relevance
- SKILL.md: instruct LLM to exclude technical/generic HF tags, cross-reference
  against priority_tags; forbid ['None'] placeholder for trigger words
- post_processor: fix preview_url not updated after download (now writes local
  .webp path to metadata); write trigger words to civitai.trainedWords instead
  of top-level; sanitize ['None']/'null'/'n/a' placeholder values to []
- download_preview() now returns str | None (local path) instead of bool
- Update tests for new return type and nested civitai.trainedWords structure

2026-07-02 22:14:44 +08:00

4.7 KiB

Raw Blame History

name, title, description, llm_required

name	title	description	llm_required
enrich_hf_metadata	Enrich Metadata from HuggingFace	Parse the HuggingFace model card via LLM to extract description, trigger words, base model, tags, and preview image URL.	true

You are an expert assistant for AI image generation models. Your task is to extract structured metadata from a HuggingFace model card (README.md).

Model Information

Repository: {{hf_url}}
Model file path: {{model_path}}
Repository ID: {{repo}}

Current Metadata (may be incomplete)

{{current_metadata}}

User Priority Tags Reference

The user has configured the following list of meaningful tag categories for this model type ({{model_type}}):

{{priority_tags}}

These are the subjects, styles, and concepts the user considers useful for categorization. Use this list as a reference when evaluating tags (see the tags section below).

Available Base Models

The following base models are currently valid in this system: {{base_models}}

HuggingFace README Content

{{readme_content}}

Extraction Instructions

Extract the following information from the README content above:

base_model

The base model this model was trained on. Use EXACTLY one of the names from the Available Base Models list above. Do not invent new names or use aliases.

Check the YAML frontmatter (between --- markers) for base_model: first, then look at the description text and safetensors metadata. If you cannot determine it, return an empty string.

trigger_words

The trigger words or activation prompts needed to use this LoRA. Look for:

instance_prompt: in the YAML frontmatter
Phrases like "trigger word:", "trigger:", "use this prompt:", "activation prompt:"
Example prompts at the start (usually the first word or phrase before any description) Return as an array of strings. If none found, return an empty array []. Never return ["None"] or any placeholder value — a truly empty list means no trigger words exist.

description

A concise 1-2 sentence summary of what this model does. Extract from the "Model description" section or the first paragraph. Return empty string if the README is too minimal.

preview_url

The URL of the most suitable preview image from the README. Look for image tags (e.g. ![alt](url)) and the YAML frontmatter widget: section (which often has output.url fields). Choose the first image that appears to be a generation example (not a logo or diagram). Construct the absolute URL as https://huggingface.co/{{repo}}/resolve/main/{filename}. If no suitable image is found, return an empty string.

confidence

Your confidence level in the extracted data:

"high" — most fields were explicitly stated in the README
"medium" — some fields were inferred from context
"low" — most fields are guesses based on limited information

Output Format

Return ONLY a JSON object with exactly these fields (no markdown fences, no extra text):

{
  "model_path": "{{model_path}}",
  "base_model": "<canonical name or empty string>",
  "trigger_words": ["<word1>", "<word2>"],
  "description": "<1-2 sentence summary>",
  "tags": ["<tag1>", "<tag2>"],
  "preview_url": "<image URL or empty string>",
  "confidence": "<high|medium|low>"
}

Important:

Only include the JSON object, no other text
If a field cannot be determined, use an empty string or empty array
Do not fabricate information not supported by the README
Never use placeholder values like "None" or "unknown" for missing data — use empty string or empty array

4.7 KiB Raw Blame History