feat(agent): optimize base model prompt — grouped display, comprehensive mapping rules, filename inference

- agent_service._format_base_models: output bullet list instead of JSON array for cleaner LLM parsing - prompt.md mapping section: replace 14-row HF→CivitAI table with compact rule set covering 14 mapping paths including new entries for HiDream-ai, OnomaAIResearch/Illustrious, ideogram-ai/ideogram, Tongyi-MAI/Z-Image-Turbo, and Wan-AI/Wan2.* - base_model extraction instruction: add guidance to infer from model filename, YAML tags, and README body text when YAML frontmatter has no explicit base_model:
2026-07-05 17:01:16 -03:00 · 2026-07-05 15:44:19 +08:00
parent 87db23825f
commit 26c9ade1c9
3 changed files with 37 additions and 26 deletions
--- a/py/services/agent/agent_service.py
+++ b/py/services/agent/agent_service.py
@@ -338,6 +338,20 @@ class AgentService:

        return result

+    # ------------------------------------------------------------------
+    # Base model grouping (keeps the prompt compact)
+    # ------------------------------------------------------------------
+
+    @staticmethod
+    def _format_base_models(models: List[str]) -> str:
+        """Format the base model list as a flat, one-per-line list.
+
+        Attempts to group by family consistently degraded LLM extraction
+        accuracy — the LLM finds individual model names harder to spot
+        in comma-separated groups than in a simple ``- Name`` list.
+        """
+        return "\n".join(f"- {m}" for m in models)
+
    async def _build_prompt_context(
        self,
        skill_name: str,
@@ -403,9 +417,11 @@ class AgentService:
            )

        try:
-            context["base_models"] = await list_base_models()
+            raw_models = await list_base_models()
+            context["base_models"] = _format_base_models(raw_models)
        except Exception as exc:
            logger.debug("Failed to list base models: %s", exc)
+            context["base_models"] = "</not available>"

        # Determine model type and load the corresponding priority_tags
        try:
--- a/py/services/agent/skills/enrich_hf_metadata/prompt.md
+++ b/py/services/agent/skills/enrich_hf_metadata/prompt.md
@@ -34,33 +34,28 @@ These are the subjects, styles, and concepts the user considers useful for categ

 ## Available Base Models

-The following base models are currently valid in this system:
+The following base models are currently valid in this system.  Use the EXACT
+name listed — do not invent aliases or modify variant suffixes.
+
 {{base_models}}

-## HuggingFace → CivitAI Base Model Mapping
+When the YAML frontmatter contains ``base_model:`` as a HuggingFace repo path,
+map it to the canonical CivitAI name as follows:

-HuggingFace repos often declare `base_model:` in YAML frontmatter using HuggingFace
-model names.  Map them to CivitAI names using this reference:
+- ``runwayml/stable-diffusion`` or ``CompVis/stable-diffusion`` with version number → **SD** series (v1-4 → SD 1.4, v1-5 → SD 1.5, v2-1 → SD 2.1, v3-5 → SD 3.5)
+- ``stabilityai/stable-diffusion-xl-base-1.0`` → **SDXL 1.0**
+- ``black-forest-labs/FLUX.1`` (dev, schnell, krea variant) → **Flux.1** (dev → D, schnell → S)
+- ``krea/Krea-2-Raw`` or ``krea/Krea-2-Turbo`` → **Krea 2**
+- ``Comfy-Org/z_image_turbo`` or ``Tongyi-MAI/Z-Image-Turbo`` → **ZImageTurbo**
+- ``HiDream-ai/HiDream`` with variant name → **HiDream**
+- ``OnomaAIResearch/Illustrious`` or any repo path containing ``illustrious`` → **Illustrious**
+- ``ideogram-ai/ideogram`` → **Ideogram 4.0**
+- ``Wan-AI/Wan2`` with version and task → **Wan Video** family (match: T2V or I2V + model size)
+- ``CogVideo`` → **CogVideoX**
+- ``stabilityai/stable-video-diffusion`` → **SVD**

-| HuggingFace repo / model name                        | CivitAI base model |
-|-------------------------------------------------------|--------------------|
-| `runwayml/stable-diffusion-v1-5`                      | SD 1.5             |
-| `CompVis/stable-diffusion-v1-4`                       | SD 1.5             |
-| `stabilityai/stable-diffusion-2-1` / `-2-1-base`     | SD 2.1             |
-| `stabilityai/stable-diffusion-xl-base-1.0`            | SDXL 1.0           |
-| `stabilityai/stable-diffusion-3-5-large`              | SD 3.5 Large       |
-| `black-forest-labs/FLUX.1-dev`                        | Flux.1 D           |
-| `black-forest-labs/FLUX.1-schnell`                    | Flux.1 S           |
-| `black-forest-labs/FLUX.1-krea`                       | Flux.1 Krea        |
-| `krea/Krea-2-Raw` or `krea/Krea-2-Turbo`              | Krea 2             |
-| `Comfy-Org/z_image_turbo`                             | ZImageTurbo        |
-| `alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union-2.1`  | ZImageTurbo        |
-| `CogVideo`                                             | CogVideoX          |
-| `Wan-AI/Wan2.1-T2V-14B`                               | Wan Video 2.2 T2V-A14B |
-| `stabilityai/stable-video-diffusion-img2vid`          | SVD                |
-
-The model file name itself may also hint at the base model (e.g. "flux", "sdxl",
-"sd15", "krea2" in the filename).
+Also check the model filename for clues (e.g. "flux" → Flux family, "sdxl" → SDXL,
+"sd15" → SD 1.5, "krea2" → Krea 2, "zimage" → ZImage).

 ## HuggingFace README Content

@@ -75,7 +70,7 @@ Extract the following information from the README content above:
 ### base_model
 The base model this model was trained on. Use EXACTLY one of the names from the **Available Base Models** list above. Do not invent new names or use aliases.

-Check the YAML frontmatter (between --- markers) for `base_model:` first, then look at the description text and safetensors metadata.  If the YAML frontmatter uses a HuggingFace model name (e.g. `runwayml/stable-diffusion-v1-5`), use the mapping table above to find the correct CivitAI name.  If you cannot determine it, return an empty string.
+Check the YAML frontmatter for ``base_model:`` first (use the mapping rules above for HF repo paths). If the frontmatter has no ``base_model:``, look at the **model filename** (``{{model_basename}}``), YAML ``tags:``, README title and first paragraph for clues — the base model family is often embedded in the name (e.g. ``krea2`` → Krea 2, ``zimage`` → ZImageTurbo, ``flux`` → Flux, ``sdxl`` → SDXL).

 ### trigger_words
 The trigger words or activation prompts needed to use this LoRA. Look for:
--- a/py/services/llm_service.py
+++ b/py/services/llm_service.py
@@ -69,7 +69,7 @@ async def _load_model_catalog() -> Dict[str, List[str]]:
            result[provider_id] = model_ids

    _catalog_cache = result
-    logger.info(
+    logger.debug(
        "Loaded model catalog: %d providers, %d total models",
        len(result),
        sum(len(m) for m in result.values()),