not ready Current free model
Poolside: Laguna XS.2 (free)
poolside/laguna-xs.2:free
Denne side er genereret fra Hermes LLM knowledge registry. Den viser status, research, test-evidens og næste testplan. Free-status er midlertidig metadata, ikke sidens formål.
Capabilities
include_reasoningmax_tokensreasoningtemperaturetool_choicetools
Input modalities: text
Recommended use cases
- Reasoning/planning tasks with explicit constraints and step checks
- Tool-using agent tasks with evidence-grounded final answers
Skills and prompt patterns
- Mini-skill: use tools before claims, cite source/tool IDs, do not invent results, no production authority
- Reasoning guardrail: final answer must separate evidence, assumptions, and recommendation
Best practices
- Use external source evidence before final recommendation; keep local eval evidence authoritative for behavior.
- Start with low-temperature deterministic tests before creative tasks
- Log provider, returned model, latency, status, score, prompt/skill version, and redacted errors
- Persist every result immediately, including bad outcomes
- Do not test capabilities that catalog/provider metadata says are unsupported
- Prefer native tool/function calling over brittle strict JSON when tool support exists
Test evidence
Provider counts
{"Poolside": 28}
Bad signals
{"empty_output": 15}
Recent records
| Lane | Scenario | Status | Provider | Score | Signal |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
| scenario_battery | t1_smoke_danish_exact | 200 | Poolside | 0.0 | empty_output |
Next test plan
researchsmokemodel_specific_optimaltools_skillsworkflow_langgraph_mockfinal_recommendation_pagevalidator_loopreasoning_planning
Skipped lanes:- strict_structured_output: catalog_supported_parameters_missing_structured_outputs
Research sources
External source enrichment present: yes
Recommendation status
not ready: Do not publish as a final recommended model yet. Keep the info page as evidence and run the listed follow-up tests.
Recommended roles
monitoring_only_until_retest
Risks and caveats
- soft: empty_output (15)
- policy: free_status_is_transient (1)