not ready Current free model
LiquidAI: LFM2.5-1.2B-Thinking (free)
liquid/lfm-2.5-1.2b-thinking:free
Denne side er genereret fra Hermes LLM knowledge registry. Den viser status, research, test-evidens og næste testplan. Free-status er midlertidig metadata, ikke sidens formål.
Capabilities
frequency_penaltyinclude_reasoningmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p
Input modalities: text
Recommended use cases
- Reasoning/planning tasks with explicit constraints and step checks
- Structured extraction or validated JSON outputs, with parser validation
- Tool-using agent tasks with evidence-grounded final answers
Skills and prompt patterns
- Mini-skill: use tools before claims, cite source/tool IDs, do not invent results, no production authority
- Reasoning guardrail: final answer must separate evidence, assumptions, and recommendation
Best practices
- Use external source evidence before final recommendation; keep local eval evidence authoritative for behavior.
- Start with low-temperature deterministic tests before creative tasks
- Log provider, returned model, latency, status, score, prompt/skill version, and redacted errors
- Persist every result immediately, including bad outcomes
- Do not test capabilities that catalog/provider metadata says are unsupported
- Prefer native tool/function calling over brittle strict JSON when tool support exists
- Validate structured outputs with an external parser; do not trust raw format claims
Test evidence
Provider counts
{"Liquid": 27}
Bad signals
{"empty_output": 19, "invalid_json": 1}
Recent records
| Lane | Scenario | Status | Provider | Score | Signal |
| optimal_tools_skills | optimal_policy_dry_run | 200 | Liquid | 80.0 | |
| optimal_tools_skills | optimal_research_synthesis | 200 | Liquid | 100.0 | |
| optimal_tools_skills | optimal_file_diagnosis | 200 | Liquid | 100.0 | |
| scenario_battery | t1_smoke_danish_exact | 200 | Liquid | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Liquid | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Liquid | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Liquid | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Liquid | 0.0 | empty_output |
Next test plan
researchsmokemodel_specific_optimaltools_skillsworkflow_langgraph_mockfinal_recommendation_pagevalidator_loopstructured_output_validationreasoning_planning
No skipped lanes recorded.
Research sources
External source enrichment present: yes
Recommendation status
not ready: Do not publish as a final recommended model yet. Keep the info page as evidence and run the listed follow-up tests.
Recommended roles
monitoring_only_until_retest
Risks and caveats
- soft: empty_output (19)
- soft: invalid_json (1)
- policy: free_status_is_transient (1)