not ready Current free model
Google: Gemma 4 26B A4B (free)
google/gemma-4-26b-a4b-it:free
Denne side er genereret fra Hermes LLM knowledge registry. Den viser status, research, test-evidens og næste testplan. Free-status er midlertidig metadata, ikke sidens formål.
Capabilities
frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningrepetition_penaltyresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_p
Input modalities: image, text, video
Recommended use cases
- Reasoning/planning tasks with explicit constraints and step checks
- Structured extraction or validated JSON outputs, with parser validation
- Tool-using agent tasks with evidence-grounded final answers
- Vision or multimodal analysis where image input is needed
Skills and prompt patterns
- Mini-skill: use tools before claims, cite source/tool IDs, do not invent results, no production authority
- Reasoning guardrail: final answer must separate evidence, assumptions, and recommendation
Best practices
- Use external source evidence before final recommendation; keep local eval evidence authoritative for behavior.
- Start with low-temperature deterministic tests before creative tasks
- Log provider, returned model, latency, status, score, prompt/skill version, and redacted errors
- Persist every result immediately, including bad outcomes
- Do not test capabilities that catalog/provider metadata says are unsupported
- Prefer native tool/function calling over brittle strict JSON when tool support exists
- Validate structured outputs with an external parser; do not trust raw format claims
Test evidence
Status counts
{"429": 32, "200": 14}
Provider counts
{"unknown": 32, "Google AI Studio": 12, "Darkbloom": 2}
Bad signals
{"429_cooldown_or_provider_rate_limit": 32}
Recent records
| Lane | Scenario | Status | Provider | Score | Signal |
| optimal_tools_skills | optimal_policy_dry_run | 429 | | 0.0 | 429_cooldown_or_provider_rate_limit |
| optimal_tools_skills | optimal_research_synthesis | 429 | | 0.0 | 429_cooldown_or_provider_rate_limit |
| optimal_tools_skills | optimal_file_diagnosis | 429 | | 0.0 | 429_cooldown_or_provider_rate_limit |
| scenario_battery | t1_smoke_danish_exact | 429 | | 0.0 | 429_cooldown_or_provider_rate_limit |
| scenario_battery | t1_smoke_danish_exact | 429 | | 0.0 | 429_cooldown_or_provider_rate_limit |
| scenario_battery | t1_smoke_danish_exact | 200 | Google AI Studio | 100.0 | |
| scenario_battery | t1_smoke_danish_exact | 200 | Google AI Studio | 100.0 | |
| scenario_battery | t1_smoke_danish_exact | 200 | Google AI Studio | 100.0 | |
Next test plan
researchsmokemodel_specific_optimaltools_skillsworkflow_langgraph_mockvision_multimodalfinal_recommendation_pagevalidator_loopstructured_output_validationreasoning_planning
No skipped lanes recorded.
Research sources
External source enrichment present: yes
Recommendation status
not ready: Do not publish as a final recommended model yet. Keep the info page as evidence and run the listed follow-up tests.
Recommended roles
monitoring_only_until_retest
Risks and caveats
- hard: 429_cooldown_or_provider_rate_limit (32)
- policy: free_status_is_transient (1)