Supplemental result ledger

Every summarized SignalLatch v1.1 quant result.

This page is the heavy evidence appendix for the v1.1 Lite and Plus release. It includes every candidate row from the stable promotion summary, every candidate row from the later comprehensive summary, and every real-world ClawCode harness run that fed the release decision.

Back to production recordPatchCode SignalLatch overviewHugging Face repo
163stable promotion-screen candidate rows
440comprehensive candidate rows
32real-world ClawCode run rows
882generated catalog recipes available for lookup

Scope of this supplement

This page is intentionally large. The main production record explains the release decision. This supplement exists for readers who want the broad sweep: the score distributions, all candidate rows, recipe notes, and real-world harness rows.

Important boundary: this is every row in the summarized release datasets used here, not every unused generated recipe in the 882-entry candidate catalog. Untested generated recipes are not treated as results.

How the combinations started: the v1.1 mixed quants began as manual tensor-group recipes. We selected model regions such as full-attention Q/K/V/O, full-attention output, token/output tensors, RYS signal-path tensors, RYS FFN tensors, duplicate-boundary tensors, and late FFN control tensors, then paired those groups with ik-llama quant types. Scripts expanded, built, scheduled, and scored the recipe families, but the initial combinations were hand-designed hypotheses rather than an automatic per-tensor optimizer.

Published Lite

c718_fa_iq3kt_ryssig_q6k

Highlighted green throughout the tables.

Published Plus

c154_fao_bf16_faqk_q8_lateffn_q8

Highlighted blue throughout the tables.

Original baseline

release_s010_iq4nl

Highlighted brown where present.

External baselines

unsloth_iq4nl, unsloth_q8_0

Highlighted gold where present.

What the scores mean

The score columns on this page come from two different evaluation layers. The hard-suite and comprehensive-suite rows are automated prompt-screen scores on a 0.0 to 1.0 scale. The real-world rows are verifier component counts such as 14/15 or 19/19.

Release interpretation: the prompt-suite scores were used for broad candidate discovery and narrowing. The real-world ClawCode scores carried more release weight because they tested whether a model could explore, edit, run tests, and repair inside an actual project workspace.

Prompt-suite score columns

ColumnMeaningHow to read it
weighted or scorePrimary automated prompt-suite score: 0.50 * habit_score + 0.35 * hard_score + 0.15 * mean_score - 0.05 * errors, clipped at zero.Higher is better. It intentionally weights SignalLatch process habits more than raw puzzle accuracy.
habit_scoreMean score for tasks in the signallatch_habits category.Measures behavior retention: context gathering, command lifecycle control, repair loops, preserving user changes, calibrated uncertainty, and related agent habits.
hard_scoreMean score for hard_reasoning, linked_reasoning, and hard_project tasks.Measures exact reasoning, dependency-order reasoning, long-context instruction following, and practical planning under constraints.
mean_scoreSimple average across every task in that suite.Useful as a sanity check when weighted score and category scores diverge.
errorsCount of prompt requests that failed, timed out, or produced no usable response.Each error lowers the weighted score and is treated as reliability evidence, not just a missing answer.
size_gbCandidate file size in decimal gigabytes.Used to distinguish practical release files from larger variants that keep more tensors at high precision.
score_minus_size_penaltyA screening-only size-adjusted score included in some summary tables.Useful for finding practical candidates, but the final release decision used raw real-world evidence plus size/role tradeoffs rather than this column alone.
score_per_gbWeighted score divided by candidate size.Efficiency hint. It is not a quality score by itself.

How individual prompt tasks were scored

Exact tasks used answer-pattern checks plus required-term checks. The exact-task score was 0.78 answer-pattern coverage plus 0.22 required-term coverage, with penalties for forbidden terms and excessive repeated 5-grams.

Rubric tasks used category keywords. Each rubric category passed if the answer contained at least one term from that category. The task score was the mean of category passes, with penalties for very short answers, forbidden terms, and excessive repeated 5-grams.

Because this is an automated screen, a high prompt-suite score means "passed the public scoring signals for this suite"; it does not mean a human would prefer that model in every coding session. That is why the project-level ClawCode harnesses were added after the broad screen.

Stable gate vs comprehensive gate

SuiteRows shown herePrompt tasksPurpose
Stable promotion screen16312Fast gate used as the stable public count: 163 * 12 = 1,956 prompt-level scored results.
Comprehensive summary44016Broader later sweep with the same 12 core tasks plus four long-context retention tasks.
Real-world ClawCode harnesses32Project verifiersFinalist comparison using actual workspaces and component checks.

Prompt-suite task catalog

Task IDCategoryScoring typeWhat it tested
crt_exact_1136hard_reasoningExactChinese-remainder arithmetic, required modular checks, and a final exact answer of 1136.
state_machine_exact_8_minus5hard_reasoningExactSix-step conditional state tracking, avoiding skipped updates, ending with x=8 and y=-5.
binary_strings_exact_126hard_reasoningExactCombinatorics with no-adjacent-ones gap reasoning, ending with 126.
dependency_chain_project_orderlinked_reasoningRubricEvidence-first pipeline debugging, dependency order, validation, and avoiding the wrong parser-rewrite scope.
deadlock_ordering_planhard_reasoningRubricCycle diagnosis, lock-ordering rule, regression tests, and validation for a deadlock-style failure.
agent_context_sufficiencysignallatch_habitsRubricGoal tracking, context gaps, evidence to inspect, and a stop condition before editing a vague repo issue.
command_lifecycle_controlsignallatch_habitsRubricStarting, monitoring, polling, logging, stopping, and cleaning up a long GPU/model evaluation without killing unrelated services.
repair_from_failure_signalsignallatch_habitsRubricUsing a concrete focused-test failure to plan minimal repair and choose when to rerun broad tests.
restrained_delegationsignallatch_habitsRubricKeeping blocking stack-trace work local while delegating only independent, non-overlapping side work.
preserve_user_changessignallatch_habitsRubricDirty-worktree discipline: do not revert unrelated user changes, ignore unrelated files, and work with same-file changes when needed.
complex_project_plan_hardhard_projectRubricPlanning an automated quant sweep under two-server, GPU-pool, cleanup, and tiered-test constraints.
uncertainty_and_evidencesignallatch_habitsRubricCalibrated model-selection answer when style evidence and exact-reasoning evidence disagree.
long_exact_redwood_12289hard_reasoningExactLong-context active-control-packet retention with noisy irrelevant context, ending with 12289.
long_linked_topology_orderlinked_reasoningExactLong-context task-order retention, requiring READ > STOP > PATCH > BUILD > EVAL > PRUNE.
long_habit_preserve_changessignallatch_habitsRubricLong-context retention of inspect-before-editing, no destructive git operations, user-change preservation, conflict scope, and targeted validation.
long_project_gate_7046hard_projectExactLong project-transcript retention of goal tracking, evidence-before-editing, command lifecycle control, scoped validation, and final exact answer 7046.

Real-world harness scores

A real-world score such as 14/15 means the verifier found 14 passing components out of 15. Components included visible tests, hidden tests, and structural checks such as keeping scope focused, preserving tests, avoiding dependency churn, preserving old retrieval paths, or adding deterministic verification paths.

The ClawCode process return code was recorded but was not the quality score. The verifier inspected the final workspace and counted behavior-specific components, because a run can produce a useful patch while still ending with a non-zero tool return or a missed hidden edge case.

Massive summary graphs

The bar lists below are full result lists, not top-N excerpts. They are scrollable because the comprehensive run has 440 rows.

All comprehensive candidates by weighted score

1c373_fa_iq3kr40.8935
2c281_lateffn_iq3kt0.8916
3c718_fa_iq3kt_ryssig_q6k0.8822
4c204_fao_iq4kss0.8761
5c311_outtok_iq3kt0.8741
6c317_out_iq5k0.8741
7c254_rysffn_iq4kss0.8741
8c500_fa_iq4ks_ryssig_iq4kss0.8741
9c234_bndall_iq4kss0.8741
10c253_rysffn_iq4ks0.8741
11c169_fa_iq4xs0.8741
12c274_lateffn_iq4kss0.8741
13c456_latectl_iq4xsr80.8741
14c213_ryspath_iq4ks0.8741
15c364_duppath_iq4kt0.8741
16c201_fav_iq3kt0.8741
17c273_lateffn_iq4ks0.8741
18c200_fav_iq4kt0.8741
19c339_earlyattn_iq4xs0.8741
20c193_fav_iq4ks0.8741
21c369_fa_iq4kr40.8741
22c336_earlyattn_iq4k0.8741
23c423_ryspath_iq4nlr40.8741
24c431_ryspath_q40r80.8741
25c206_fao_iq4k0.8741
26c226_ryssig_iq4k0.8741
27c397_faqk_q40r80.8741
28c205_fao_iq5ks0.8741
29c433_ryspath_q5kr40.8741
30c649_fa_iq5k_rysffn_iq4ks0.8741
31c21_fao_q6k0.8741
32c208_fao_iq6k0.8741
33c404_faqkv_iq5kr40.8741
34c610_fa_iq4k_ryssig_q6k0.8741
35c655_fa_iq5k_bndffn_iq4ks0.8741
36c167_fa_iq5k0.8741
37c370_fa_iq5kr40.8741
38c379_fa_q50r40.8741
39c22_fao_q80.8741
40c391_faqk_q8r80.8741
41c616_fa_iq4k_rysffn_q6k0.8741
42c587_fa_iq5ks_bndffn_q80.8741
43c77_fa_q6k_ryssig_q80.8741
44c666_fa_iq5k_lateffn_bf160.8741
45c93_fa_q8_bndffn_q80.8741
46c154_fao_bf16_faqk_q8_lateffn_q80.8741
47c06_output_token_q8_fullattn_q80.8741
48c05_all_fullattn_bf160.8741
49c56_outtok_bf160.8741
50c145_fa_bf16_outtok_q8_ryssig_q80.8741
51c147_fa_bf16_outtok_q8_bndffn_q80.8741
52c106_fa_bf16_bndffn_bf160.8741
53c88_fa_q8_outtok_bf160.8741
54c100_fa_bf16_outtok_bf160.8741
55c172_fa_mxfp40.8734
56c192_faqkv_mxfp40.8734
57c239_bndall_iq4xs0.8722
58c377_fa_q600.8722
59c413_faqkv_q50r40.8715
60c207_fao_iq5k0.8683
61c312_outtok_mxfp40.8678
62c300_finalffn_iq4kt0.8664
63c474_outtok_iq4nlr40.8645
64c691_fa_iq4kt_bndffn_iq4ks0.8645
65c184_faqkv_iq4kss0.8645
66c330_tok_iq4kt0.8645
67c174_faqk_iq4kss0.8645
68c244_bndffn_iq4kss0.8645
69c329_tok_iq4xs0.8645
70c219_ryspath_iq4xs0.8645
71c210_fao_iq4kt0.8645
72c252_bndffn_mxfp40.8645
73c279_lateffn_iq4xs0.8645
74c209_fao_iq4xs0.8645
75c203_fao_iq4ks0.8645
76c166_fa_iq4k0.8645
77c472_outtok_iq5kr40.8645
78c267_lateattn_iq5k0.8645
79c583_fa_iq5ks_bndffn_iq4ks0.8645
80c639_fa_iq5k_outtok_iq5ks0.8645
81c464_latectl_q50r40.8645
82c467_latectl_q5kr40.8645
83c382_fa_q5kr40.8645
84c491_ryssig_q8kv0.8645
85c604_fa_iq4k_outtok_q6k0.8645
86c178_faqk_iq6k0.8645
87c111_fav_bf16_fao_q80.8645
88c544_fa_iq4kss_rysffn_q6k0.8645
89c23_fao_bf160.8645
90c564_fa_iq4kss_finalffn_bf160.8645
91c569_fa_iq5ks_outtok_q80.8645
92c152_fav_bf16_fao_bf16_bndffn_q80.8645
93c151_fav_bf16_fao_bf16_outtok_q80.8645
94c141_fa_q8_outtok_q8_bndffn_q80.8645
95c64_fa_q5k_outtok_bf160.8645
96c76_fa_q6k_outtok_bf160.8645
97c751_fa_mxfp4_ryssig_iq4ks0.8638
98c592_fa_iq5ks_lateffn_q6k0.8635
99c322_out_mxfp40.8581
100c407_faqkv_iq3kr40.8575
101c214_ryspath_iq4kss0.8575
102c190_faqkv_iq4kt0.8559
103c242_bndall_mxfp40.8559
104c348_srcsig_mxfp40.8559
105c450_ryssig_q5kr40.8559
106c277_lateffn_iq5k0.8559
107c590_fa_iq5ks_lateffn_iq4kss0.8559
108c492_lateattn_q8kv0.8559
109c429_ryspath_q60r40.8559
110c494_fa_iq4ks_outtok_iq4kss0.8539
111c734_fa_iq3kt_lateffn_iq4kss0.8539
112c171_fa_iq3kt0.8539
113c306_outtok_iq4k0.8539
114c471_outtok_iq4kr40.8539
115c241_bndall_iq3kt0.8539
116c191_faqkv_iq3kt0.8539
117c315_out_iq5ks0.8539
118c692_fa_iq4kt_bndffn_iq4kss0.8539
119c221_ryspath_iq3kt0.8539
120c251_bndffn_iq3kt0.8539
121c505_fa_iq4ks_rysffn_iq4ks0.8539
122c164_fa_iq4kss0.8539
123c170_fa_iq4kt0.8539
124c524_fa_iq4ks_finalffn_iq4kss0.8539
125c231_ryssig_iq3kt0.8539
126c271_lateattn_iq3kt0.8539
127c470_outtok_iq5ksr40.8539
128c259_rysffn_iq4xs0.8539
129c290_latectl_iq4kt0.8539
130c180_faqk_iq4kt0.8539
131c220_ryspath_iq4kt0.8539
132c250_bndffn_iq4kt0.8539
133c353_dupsig_iq3kt0.8539
134c189_faqkv_iq4xs0.8539
135c183_faqkv_iq4ks0.8539
136c233_bndall_iq4ks0.8539
137c280_lateffn_iq4kt0.8539
138c224_ryssig_iq4kss0.8539
139c230_ryssig_iq4kt0.8539
140c289_latectl_iq4xs0.8539
141c332_tok_mxfp40.8539
142c323_tok_iq4ks0.8539
143c182_faqk_mxfp40.8539
144c422_ryspath_iq4xsr80.8539
145c173_faqk_iq4ks0.8539
146c384_faqk_iq4ksr40.8539
147c418_ryspath_iq4ksr40.8539
148c358_srcpath_iq4kt0.8539
149c270_lateattn_iq4kt0.8539
150c243_bndffn_iq4ks0.8539
151c334_earlyattn_iq4kss0.8539
152c340_earlyattn_iq4kt0.8539
153c299_finalffn_iq4xs0.8539
154c229_ryssig_iq4xs0.8539
155c350_dupsig_iq4kss0.8539
156c435_ryssig_iq4ksr40.8539
157c269_lateattn_iq4xs0.8539
158c366_duppath_mxfp40.8539
159c333_earlyattn_iq4ks0.8539
160c196_fav_iq4k0.8539
161c354_dupsig_mxfp40.8539
162c349_dupsig_iq4ks0.8539
163c307_outtok_iq5k0.8539
164c481_outtok_q50r40.8539
165c484_outtok_q5kr40.8539
166c465_latectl_q40r80.8539
167c216_ryspath_iq4k0.8539
168c236_bndall_iq4k0.8539
169c176_faqk_iq4k0.8539
170c197_fav_iq5k0.8539
171c386_faqk_iq4kr40.8539
172c398_faqk_q4kr40.8539
173c440_ryssig_iq4nlr40.8539
174c449_ryssig_q4kr40.8539
175c318_out_iq6k0.8539
176c198_fav_iq6k0.8539
177c345_srcsig_iq5ks0.8539
178c335_earlyattn_iq5ks0.8539
179c627_fa_iq4k_lateffn_iq5ks0.8539
180c633_fa_iq4k_finalffn_iq5ks0.8539
181c225_ryssig_iq5ks0.8539
182c436_ryssig_iq5ksr40.8539
183c621_fa_iq4k_bndffn_iq5ks0.8539
184c338_earlyattn_iq6k0.8539
185c453_latectl_iq5ksr40.8539
186c385_faqk_iq5ksr40.8539
187c421_ryspath_iq5kr40.8539
188c187_faqkv_iq5k0.8539
189c387_faqk_iq5kr40.8539
190c327_tok_iq5k0.8539
191c368_fa_iq5ksr40.8539
192c237_bndall_iq5k0.8539
193c228_ryssig_iq6k0.8539
194c278_lateffn_iq6k0.8539
195c676_fa_iq4kt_outtok_q6k0.8539
196c539_fa_iq4kss_ryssig_q80.8539
197c428_ryspath_q600.8539
198c248_bndffn_iq6k0.8539
199c434_ryspath_q6kr40.8539
200c218_ryspath_iq6k0.8539
201c444_ryssig_q8kvr80.8539
202c463_latectl_q60r40.8539
203c395_faqk_q60r40.8539
204c480_outtok_q60r40.8539
205c443_ryssig_q8kr80.8539
206c411_faqkv_q600.8539
207c412_faqkv_q60r40.8539
208c288_latectl_iq6k0.8539
209c417_faqkv_q6kr40.8539
210c442_ryssig_q8r80.8539
211c308_outtok_iq6k0.8539
212c35_bndall_q6k0.8539
213c238_bndall_iq6k0.8539
214c427_ryspath_q8kvr80.8539
215c461_latectl_q8kvr80.8539
216c460_latectl_q8kr80.8539
217c688_fa_iq4kt_rysffn_q6k0.8539
218c599_fa_iq5ks_finalffn_q80.8539
219c409_faqkv_q8kr80.8539
220c408_faqkv_q8r80.8539
221c03_habit_boundary_q80.8539
222c45_lateattn_bf160.8539
223c478_outtok_q8kvr80.8539
224c580_fa_iq5ks_rysffn_q6k0.8539
225c477_outtok_q8kr80.8539
226c702_fa_iq4kt_lateffn_bf160.8539
227c476_outtok_q8r80.8539
228c689_fa_iq4kt_rysffn_q80.8539
229c89_fa_q8_ryssig_q80.8539
230c672_fa_iq5k_finalffn_bf160.8539
231c114_fao_bf16_faqk_q80.8539
232c15_faqk_bf160.8539
233c87_fa_q8_outtok_q80.8539
234c101_fa_bf16_ryssig_q80.8539
235c534_fa_iq4kss_outtok_bf160.8539
236c105_fa_bf16_bndffn_q80.8539
237c99_fa_bf16_outtok_q80.8539
238c102_fa_bf16_ryssig_bf160.8539
239c319_out_iq4xs0.8539
240c644_fa_iq5k_ryssig_iq4kss0.8539
241c142_fa_q8_outtok_q8_lateffn_q80.8539
242c148_fa_bf16_outtok_q8_lateffn_q80.8539
243c211_fao_iq3kt0.8524
244c466_latectl_q4kr40.8520
245c298_finalffn_iq6k0.8520
246c722_fa_iq3kt_rysffn_iq4kss0.8497
247c262_rysffn_mxfp40.8471
248c309_outtok_iq4xs0.8462
249c482_outtok_q40r80.8462
250c424_ryspath_iq3kr40.8462
251c302_finalffn_mxfp40.8462
252c282_lateffn_mxfp40.8462
253c232_ryssig_mxfp40.8462
254c381_fa_q4kr40.8462
255c195_fav_iq5ks0.8462
256c419_ryspath_iq5ksr40.8462
257c321_out_iq3kt0.8460
258c346_srcsig_iq4kt0.8460
259c295_finalffn_iq5ks0.8460
260c287_latectl_iq5k0.8452
261c591_fa_iq5ks_lateffn_iq5ks0.8452
262c393_faqk_q8kvr80.8452
263c475_outtok_iq3kr40.8443
264c529_fa_iq4kss_outtok_iq4ks0.8443
265c473_outtok_iq4xsr80.8443
266c303_outtok_iq4ks0.8443
267c313_out_iq4ks0.8443
268c316_out_iq4k0.8443
269c483_outtok_q4kr40.8443
270c741_fa_iq3kt_finalffn_iq5ks0.8443
271c331_tok_iq3kt0.8443
272c291_latectl_iq3kt0.8443
273c181_faqk_iq3kt0.8443
274c554_fa_iq4kss_lateffn_iq4kss0.8443
275c698_fa_iq4kt_lateffn_iq4kss0.8443
276c680_fa_iq4kt_ryssig_iq4kss0.8443
277c260_rysffn_iq4kt0.8443
278c559_fa_iq4kss_finalffn_iq4ks0.8443
279c301_finalffn_iq3kt0.8443
280c511_fa_iq4ks_bndffn_iq4ks0.8443
281c365_duppath_iq3kt0.8443
282c305_outtok_iq5ks0.8443
283c284_latectl_iq4kss0.8443
284c324_tok_iq4kss0.8443
285c341_earlyattn_iq3kt0.8443
286c367_fa_iq4ksr40.8443
287c347_srcsig_iq3kt0.8443
288c730_fa_iq3kt_bndffn_q6k0.8443
289c405_faqkv_iq4xsr80.8443
290c401_faqkv_iq4ksr40.8443
291c452_latectl_iq4ksr40.8443
292c179_faqk_iq4xs0.8443
293c362_duppath_iq4kss0.8443
294c249_bndffn_iq4xs0.8443
295c549_fa_iq4kss_bndffn_iq5ks0.8443
296c439_ryssig_iq4xsr80.8443
297c344_srcsig_iq4kss0.8443
298c352_dupsig_iq4kt0.8443
299c223_ryssig_iq4ks0.8443
300c272_lateattn_mxfp40.8443
301c263_lateattn_iq4ks0.8443
302c360_srcpath_mxfp40.8443
303c355_srcpath_iq4ks0.8443
304c361_duppath_iq4ks0.8443
305c212_fao_mxfp40.8443
306c342_earlyattn_mxfp40.8443
307c199_fav_iq4xs0.8443
308c372_fa_iq4nlr40.8443
309c380_fa_q40r80.8443
310c403_faqkv_iq4kr40.8443
311c343_srcsig_iq4ks0.8443
312c266_lateattn_iq4k0.8443
313c286_latectl_iq4k0.8443
314c420_ryspath_iq4kr40.8443
315c246_bndffn_iq4k0.8443
316c276_lateffn_iq4k0.8443
317c296_finalffn_iq4k0.8443
318c326_tok_iq4k0.8443
319c389_faqk_iq4nlr40.8443
320c437_ryssig_iq4kr40.8443
321c489_fav_q8kv0.8443
322c351_dupsig_iq5ks0.8443
323c265_lateattn_iq5ks0.8443
324c337_earlyattn_iq5k0.8443
325c363_duppath_iq5ks0.8443
326c275_lateffn_iq5ks0.8443
327c227_ryssig_iq5k0.8443
328c438_ryssig_iq5kr40.8443
329c584_fa_iq5ks_bndffn_iq4kss0.8443
330c215_ryspath_iq5ks0.8443
331c285_latectl_iq5ks0.8443
332c325_tok_iq5ks0.8443
333c247_bndffn_iq5k0.8443
334c268_lateattn_iq6k0.8443
335c177_faqk_iq5k0.8443
336c396_faqk_q50r40.8443
337c416_faqkv_q5kr40.8443
338c445_ryssig_q600.8443
339c446_ryssig_q60r40.8443
340c451_ryssig_q6kr40.8443
341c394_faqk_q600.8443
342c479_outtok_q600.8443
343c485_outtok_q6kr40.8443
344c328_tok_iq6k0.8443
345c188_faqkv_iq6k0.8443
346c426_ryspath_q8kr80.8443
347c575_fa_iq5ks_ryssig_q80.8443
348c392_faqk_q8kr80.8443
349c459_latectl_q8r80.8443
350c738_fa_iq3kt_lateffn_bf160.8443
351c63_fa_q5k_outtok_q80.8443
352c09_union_rys_path_q8_late_control_q80.8443
353c714_fa_iq3kt_outtok_bf160.8443
354c690_fa_iq4kt_rysffn_bf160.8443
355c582_fa_iq5ks_rysffn_bf160.8443
356c757_fa_mxfp4_rysffn_iq4ks0.8436
357c310_outtok_iq4kt0.8433
358c359_srcpath_iq3kt0.8433
359c441_ryssig_iq3kr40.8433
360c256_rysffn_iq4k0.8433
361c235_bndall_iq5ks0.8433
362c589_fa_iq5ks_lateffn_iq4ks0.8433
363c661_fa_iq5k_lateffn_iq4ks0.8433
364c258_rysffn_iq6k0.8433
365c283_latectl_iq4ks0.8424
366c356_srcpath_iq4kss0.8424
367c414_faqkv_q40r80.8424
368c468_latectl_q6kr40.8424
369c112_fav_bf16_fao_bf160.8424
370c264_lateattn_iq4kss0.8416
371c69_fa_q5k_bndffn_q80.8407
372c430_ryspath_q50r40.8404
373c320_out_iq4kt0.8390
374c425_ryspath_q8r80.8375
375c588_fa_iq5ks_bndffn_bf160.8365
376c96_fa_q8_lateffn_bf160.8365
377c255_rysffn_iq5ks0.8363
378c455_latectl_iq5kr40.8356
379c390_faqk_iq3kr40.8346
380c447_ryssig_q50r40.8344
381c469_outtok_iq4ksr40.8337
382c294_finalffn_iq4kss0.8337
383c292_latectl_mxfp40.8337
384c194_fav_iq4kss0.8337
385c406_faqkv_iq4nlr40.8337
386c707_fa_iq4kt_finalffn_q80.8337
387c462_latectl_q600.8337
388c240_bndall_iq4kt0.8337
389c432_ryspath_q4kr40.8337
390c448_ryssig_q40r80.8337
391c399_faqk_q5kr40.8337
392c490_ryspath_q8kv0.8337
393c410_faqkv_q8kvr80.8337
394c126_bndall_bf16_lateffn_q80.8337
395c257_rysffn_iq5k0.8336
396c710_fa_iq3kt_outtok_iq4kss0.8322
397c400_faqk_q6kr40.8298
398c261_rysffn_iq3kt0.8281
399c388_faqk_iq4xsr80.8281
400c202_fav_mxfp40.8276
401c83_fa_q6k_lateffn_q80.8268
402c293_finalffn_iq4ks0.8241
403c694_fa_iq4kt_bndffn_q6k0.8241
404c745_fa_mxfp4_outtok_iq4ks0.8234
405c517_fa_iq4ks_lateffn_iq4ks0.8163
406c163_fa_iq4ks0.8163
407c186_faqkv_iq4k0.8163
408c454_latectl_iq4kr40.8163
409c457_latectl_iq4nlr40.8163
410c297_finalffn_iq5k0.8163
411c175_faqk_iq5ks0.8163
412c217_ryspath_iq5k0.8163
413c585_fa_iq5ks_bndffn_iq5ks0.8163
414c586_fa_iq5ks_bndffn_q6k0.8163
415c168_fa_iq6k0.8163
416c684_fa_iq4kt_ryssig_bf160.8163
417c374_fa_q8r80.8163
418c522_fa_iq4ks_lateffn_bf160.8163
419c594_fa_iq5ks_lateffn_bf160.8163
420c371_fa_iq4xsr80.8135
421c245_bndffn_iq5ks0.8135
422c165_fa_iq5ks0.8086
423c304_outtok_iq4kss0.8066
424c415_faqkv_q4kr40.8066
425c357_srcpath_iq5ks0.8066
426c185_faqkv_iq5ks0.8066
427c402_faqkv_iq5ksr40.8066
428c726_fa_iq3kt_rysffn_bf160.8066
429c314_out_iq4kss0.7961
430c458_latectl_iq3kr40.7960
431c487_faqk_q8kv0.7960
432c222_ryspath_mxfp40.7858
433c376_fa_q8kvr80.7451
434c378_fa_q60r40.6776
435c383_fa_q6kr40.6776
436c486_fa_q8kv0.5301
437c375_fa_q8kr80.5266
438c687_fa_iq4kt_rysffn_iq5ks0.3271
439c488_faqkv_q8kv0.0000
440c581_fa_iq5ks_rysffn_q80.0000

All stable promotion-screen candidates by weighted score

1c76_fa_q6k_outtok_bf160.8413
2c21_fao_q6k0.8385
3c22_fao_q80.8385
4c69_fa_q5k_bndffn_q80.8385
5c23_fao_bf160.8385
6c112_fav_bf16_fao_bf160.8385
7c89_fa_q8_ryssig_q80.8385
8c114_fao_bf16_faqk_q80.8385
9c93_fa_q8_bndffn_q80.8385
10c152_fav_bf16_fao_bf16_bndffn_q80.8385
11c154_fao_bf16_faqk_q8_lateffn_q80.8385
12c87_fa_q8_outtok_q80.8385
13c96_fa_q8_lateffn_bf160.8385
14c142_fa_q8_outtok_q8_lateffn_q80.8385
15c126_bndall_bf16_lateffn_q80.8385
16c05_all_fullattn_bf160.8385
17c101_fa_bf16_ryssig_q80.8385
18c56_outtok_bf160.8385
19c64_fa_q5k_outtok_bf160.8385
20c99_fa_bf16_outtok_q80.8385
21c102_fa_bf16_ryssig_bf160.8385
22c148_fa_bf16_outtok_q8_lateffn_q80.8385
23c147_fa_bf16_outtok_q8_bndffn_q80.8385
24c88_fa_q8_outtok_bf160.8385
25c100_fa_bf16_outtok_bf160.8385
26c06_output_token_q8_fullattn_q80.8357
27c106_fa_bf16_bndffn_bf160.8333
28c35_bndall_q6k0.8243
29c111_fav_bf16_fao_q80.8243
30c03_habit_boundary_q80.8243
31c45_lateattn_bf160.8243
32c77_fa_q6k_ryssig_q80.8243
33c83_fa_q6k_lateffn_q80.8243
34c63_fa_q5k_outtok_q80.8243
35c15_faqk_bf160.8243
36c151_fav_bf16_fao_bf16_outtok_q80.8243
37c141_fa_q8_outtok_q8_bndffn_q80.8243
38c105_fa_bf16_bndffn_q80.8243
39c145_fa_bf16_outtok_q8_ryssig_q80.8243
40c01_rys_window_path_q80.8174
41c09_union_rys_path_q8_late_control_q80.8174
42c18_fav_q6k0.8146
43c20_fav_bf160.8146
44c27_srcsig_q80.8146
45c46_lateffn_q6k0.8146
46c61_earlyattn_q80.8146
47c37_bndffn_q6k0.8146
48c31_srcpath_q80.8146
49c33_duppath_q80.8146
50c57_out_q80.8146
51c49_latectl_q6k0.8146
52c13_faqk_q6k0.8146
53c54_outtok_q6k0.8146
54c25_ryssig_q80.8146
55c47_lateffn_q80.8146
56c52_finalffn_q80.8146
57c12_fa_q6k0.8146
58c28_srcsig_bf160.8146
59c65_fa_q5k_ryssig_q80.8146
60c62_earlyattn_bf160.8146
61c14_faqk_q80.8146
62c73_fa_q5k_finalffn_q80.8146
63c16_faqkv_q80.8146
64c116_srcsig_q8_dupsig_bf160.8146
65c113_fav_bf16_faqk_q80.8146
66c32_srcpath_bf160.8146
67c85_fa_q6k_finalffn_q80.8146
68c67_fa_q5k_rysffn_q6k0.8146
69c131_fa_q5k_ryssig_q8_lateffn_q80.8146
70c02_all_fullattn_q80.8146
71c55_outtok_q80.8146
72c26_ryssig_bf160.8146
73c81_fa_q6k_bndffn_q80.8146
74c48_lateffn_bf160.8146
75c156_srcsig_bf16_dupsig_q8_lateffn_q80.8146
76c117_srcpath_bf16_duppath_q80.8146
77c118_srcpath_q8_duppath_bf160.8146
78c125_bndall_q8_lateffn_q80.8146
79c121_lateattn_bf16_lateffn_q80.8146
80c53_finalffn_bf160.8146
81c79_fa_q6k_rysffn_q6k0.8146
82c137_fa_q6k_ryssig_q8_lateffn_q80.8146
83c72_fa_q5k_lateffn_bf160.8146
84c122_lateattn_q8_lateffn_bf160.8146
85c95_fa_q8_lateffn_q80.8146
86c97_fa_q8_finalffn_q80.8146
87c41_rysffn_q80.8146
88c138_fa_q6k_bndffn_q8_lateffn_q80.8146
89c74_fa_q5k_finalffn_bf160.8146
90c75_fa_q6k_outtok_q80.8146
91c78_fa_q6k_ryssig_bf160.8146
92c84_fa_q6k_lateffn_bf160.8146
93c58_out_bf160.8146
94c127_fa_q5k_outtok_q8_ryssig_q80.8146
95c39_bndffn_bf160.8146
96c123_latectl_q8_outtok_q80.8146
97c68_fa_q5k_rysffn_q80.8146
98c04_rys_window_path_bf160.8146
99c91_fa_q8_rysffn_q6k0.8146
100c143_fa_q8_ryssig_q8_lateffn_q80.8146
101c153_fav_bf16_faqk_q8_outtok_q80.8146
102c136_fa_q6k_outtok_q8_lateffn_q80.8146
103c144_fa_q8_bndffn_q8_lateffn_q80.8146
104c17_faqkv_bf160.8146
105c07_rys_window_all_q80.8146
106c90_fa_q8_ryssig_bf160.8146
107c135_fa_q6k_outtok_q8_bndffn_q80.8146
108c82_fa_q6k_bndffn_bf160.8146
109c98_fa_q8_finalffn_bf160.8146
110c36_bndall_bf160.8146
111c10_union_rys_path_bf16_late_control_q80.8146
112c92_fa_q8_rysffn_q80.8146
113c94_fa_q8_bndffn_bf160.8146
114c128_fa_q5k_outtok_q8_rysffn_q80.8146
115c134_fa_q6k_outtok_q8_rysffn_q80.8146
116c107_fa_bf16_lateffn_q80.8146
117c109_fa_bf16_finalffn_q80.8146
118c103_fa_bf16_rysffn_q6k0.8146
119c150_fa_bf16_bndffn_q8_lateffn_q80.8146
120c108_fa_bf16_lateffn_bf160.8146
121c42_rysffn_bf160.8146
122c110_fa_bf16_finalffn_bf160.8146
123c104_fa_bf16_rysffn_q80.8146
124c146_fa_bf16_outtok_q8_rysffn_q80.8146
125c51_finalffn_q6k0.8117
126c86_fa_q6k_finalffn_bf160.8032
127c140_fa_q8_outtok_q8_rysffn_q80.8032
128c40_rysffn_q6k0.8031
129c19_fav_q80.8004
130c43_lateattn_q6k0.8004
131c29_dupsig_q80.8004
132c24_ryssig_q6k0.8004
133c11_fa_q5k0.8004
134c44_lateattn_q80.8004
135c30_dupsig_bf160.8004
136c38_bndffn_q80.8004
137c71_fa_q5k_lateffn_q80.8004
138c08_late_control_q80.8004
139c59_tok_q80.8004
140c115_srcsig_bf16_dupsig_q80.8004
141c34_duppath_bf160.8004
142c132_fa_q5k_bndffn_q8_lateffn_q80.8004
143c66_fa_q5k_ryssig_bf160.8004
144c130_fa_q5k_outtok_q8_lateffn_q80.8004
145c155_srcsig_bf16_dupsig_q8_outtok_q80.8004
146c129_fa_q5k_outtok_q8_bndffn_q80.8004
147c133_fa_q6k_outtok_q8_ryssig_q80.8004
148c50_latectl_bf160.8004
149c80_fa_q6k_rysffn_q80.8004
150c60_tok_bf160.8004
151c160_ryspath_q8_outtok_q8_lateffn_q80.8004
152c157_srcpath_bf16_duppath_q8_outtok_q80.8004
153c161_lateattn_bf16_lateffn_q8_outtok_q80.8004
154c162_latectl_q8_outtok_q8_bndffn_q80.8004
155c139_fa_q8_outtok_q8_ryssig_q80.8004
156c120_ryssig_bf16_rysffn_q80.8004
157c124_latectl_bf16_outtok_q80.8004
158c159_ryspath_bf16_outtok_q8_lateffn_q80.8004
159c149_fa_bf16_ryssig_q8_lateffn_q80.8004
160release_s010_iq4nl0.7954
161c70_fa_q5k_bndffn_bf160.7952
162c158_ryssig_bf16_rysffn_q8_lateffn_q80.7906
163c119_ryssig_q8_rysffn_q80.7889
Habit score vs hard score c373_fa_iq3kr4: habit score 0.8786, hard score 0.913 c281_lateffn_iq3kt: habit score 0.8786, hard score 0.9085 c718_fa_iq3kt_ryssig_q6k: habit score 0.9286, hard score 0.8219 c204_fao_iq4kss: habit score 0.9143, hard score 0.8263 c311_outtok_iq3kt: habit score 0.9143, hard score 0.8219 c317_out_iq5k: habit score 0.9143, hard score 0.8219 c254_rysffn_iq4kss: habit score 0.9143, hard score 0.8219 c500_fa_iq4ks_ryssig_iq4kss: habit score 0.9143, hard score 0.8219 c234_bndall_iq4kss: habit score 0.9143, hard score 0.8219 c253_rysffn_iq4ks: habit score 0.9143, hard score 0.8219 c169_fa_iq4xs: habit score 0.9143, hard score 0.8219 c274_lateffn_iq4kss: habit score 0.9143, hard score 0.8219 c456_latectl_iq4xsr8: habit score 0.9143, hard score 0.8219 c213_ryspath_iq4ks: habit score 0.9143, hard score 0.8219 c364_duppath_iq4kt: habit score 0.9143, hard score 0.8219 c201_fav_iq3kt: habit score 0.9143, hard score 0.8219 c273_lateffn_iq4ks: habit score 0.9143, hard score 0.8219 c200_fav_iq4kt: habit score 0.9143, hard score 0.8219 c339_earlyattn_iq4xs: habit score 0.9143, hard score 0.8219 c193_fav_iq4ks: habit score 0.9143, hard score 0.8219 c369_fa_iq4kr4: habit score 0.9143, hard score 0.8219 c336_earlyattn_iq4k: habit score 0.9143, hard score 0.8219 c423_ryspath_iq4nlr4: habit score 0.9143, hard score 0.8219 c431_ryspath_q40r8: habit score 0.9143, hard score 0.8219 c206_fao_iq4k: habit score 0.9143, hard score 0.8219 c226_ryssig_iq4k: habit score 0.9143, hard score 0.8219 c397_faqk_q40r8: habit score 0.9143, hard score 0.8219 c205_fao_iq5ks: habit score 0.9143, hard score 0.8219 c433_ryspath_q5kr4: habit score 0.9143, hard score 0.8219 c649_fa_iq5k_rysffn_iq4ks: habit score 0.9143, hard score 0.8219 c21_fao_q6k: habit score 0.9143, hard score 0.8219 c208_fao_iq6k: habit score 0.9143, hard score 0.8219 c404_faqkv_iq5kr4: habit score 0.9143, hard score 0.8219 c610_fa_iq4k_ryssig_q6k: habit score 0.9143, hard score 0.8219 c655_fa_iq5k_bndffn_iq4ks: habit score 0.9143, hard score 0.8219 c167_fa_iq5k: habit score 0.9143, hard score 0.8219 c370_fa_iq5kr4: habit score 0.9143, hard score 0.8219 c379_fa_q50r4: habit score 0.9143, hard score 0.8219 c22_fao_q8: habit score 0.9143, hard score 0.8219 c391_faqk_q8r8: habit score 0.9143, hard score 0.8219 c616_fa_iq4k_rysffn_q6k: habit score 0.9143, hard score 0.8219 c587_fa_iq5ks_bndffn_q8: habit score 0.9143, hard score 0.8219 c77_fa_q6k_ryssig_q8: habit score 0.9143, hard score 0.8219 c666_fa_iq5k_lateffn_bf16: habit score 0.9143, hard score 0.8219 c93_fa_q8_bndffn_q8: habit score 0.9143, hard score 0.8219 c154_fao_bf16_faqk_q8_lateffn_q8: habit score 0.9143, hard score 0.8219 c06_output_token_q8_fullattn_q8: habit score 0.9143, hard score 0.8219 c05_all_fullattn_bf16: habit score 0.9143, hard score 0.8219 c56_outtok_bf16: habit score 0.9143, hard score 0.8219 c145_fa_bf16_outtok_q8_ryssig_q8: habit score 0.9143, hard score 0.8219 c147_fa_bf16_outtok_q8_bndffn_q8: habit score 0.9143, hard score 0.8219 c106_fa_bf16_bndffn_bf16: habit score 0.9143, hard score 0.8219 c88_fa_q8_outtok_bf16: habit score 0.9143, hard score 0.8219 c100_fa_bf16_outtok_bf16: habit score 0.9143, hard score 0.8219 c172_fa_mxfp4: habit score 0.9143, hard score 0.8202 c192_faqkv_mxfp4: habit score 0.9143, hard score 0.8202 c239_bndall_iq4xs: habit score 0.9143, hard score 0.8174 c377_fa_q60: habit score 0.9143, hard score 0.8174 c413_faqkv_q50r4: habit score 0.9143, hard score 0.8157 c207_fao_iq5k: habit score 0.9143, hard score 0.8085 c312_outtok_mxfp4: habit score 0.8786, hard score 0.8537 c300_finalffn_iq4kt: habit score 0.9143, hard score 0.8041 c474_outtok_iq4nlr4: habit score 0.9143, hard score 0.7996 c691_fa_iq4kt_bndffn_iq4ks: habit score 0.9143, hard score 0.7996 c184_faqkv_iq4kss: habit score 0.9143, hard score 0.7996 c330_tok_iq4kt: habit score 0.9143, hard score 0.7996 c174_faqk_iq4kss: habit score 0.9143, hard score 0.7996 c244_bndffn_iq4kss: habit score 0.9143, hard score 0.7996 c329_tok_iq4xs: habit score 0.9143, hard score 0.7996 c219_ryspath_iq4xs: habit score 0.9143, hard score 0.7996 c210_fao_iq4kt: habit score 0.9143, hard score 0.7996 c252_bndffn_mxfp4: habit score 0.9143, hard score 0.7996 c279_lateffn_iq4xs: habit score 0.9143, hard score 0.7996 c209_fao_iq4xs: habit score 0.9143, hard score 0.7996 c203_fao_iq4ks: habit score 0.9143, hard score 0.7996 c166_fa_iq4k: habit score 0.9143, hard score 0.7996 c472_outtok_iq5kr4: habit score 0.9143, hard score 0.7996 c267_lateattn_iq5k: habit score 0.9143, hard score 0.7996 c583_fa_iq5ks_bndffn_iq4ks: habit score 0.9143, hard score 0.7996 c639_fa_iq5k_outtok_iq5ks: habit score 0.9143, hard score 0.7996 c464_latectl_q50r4: habit score 0.9143, hard score 0.7996 c467_latectl_q5kr4: habit score 0.9143, hard score 0.7996 c382_fa_q5kr4: habit score 0.9143, hard score 0.7996 c491_ryssig_q8kv: habit score 0.9143, hard score 0.7996 c604_fa_iq4k_outtok_q6k: habit score 0.9143, hard score 0.7996 c178_faqk_iq6k: habit score 0.9143, hard score 0.7996 c111_fav_bf16_fao_q8: habit score 0.9143, hard score 0.7996 c544_fa_iq4kss_rysffn_q6k: habit score 0.9143, hard score 0.7996 c23_fao_bf16: habit score 0.9143, hard score 0.7996 c564_fa_iq4kss_finalffn_bf16: habit score 0.9143, hard score 0.7996 c569_fa_iq5ks_outtok_q8: habit score 0.9143, hard score 0.7996 c152_fav_bf16_fao_bf16_bndffn_q8: habit score 0.9143, hard score 0.7996 c151_fav_bf16_fao_bf16_outtok_q8: habit score 0.9143, hard score 0.7996 c141_fa_q8_outtok_q8_bndffn_q8: habit score 0.9143, hard score 0.7996 c64_fa_q5k_outtok_bf16: habit score 0.9143, hard score 0.7996 c76_fa_q6k_outtok_bf16: habit score 0.9143, hard score 0.7996 c751_fa_mxfp4_ryssig_iq4ks: habit score 0.9143, hard score 0.798 c592_fa_iq5ks_lateffn_q6k: habit score 0.9143, hard score 0.7974 c322_out_mxfp4: habit score 0.8786, hard score 0.8315 c407_faqkv_iq3kr4: habit score 0.8786, hard score 0.83 c214_ryspath_iq4kss: habit score 0.8786, hard score 0.83 c190_faqkv_iq4kt: habit score 0.8786, hard score 0.8263 c242_bndall_mxfp4: habit score 0.8786, hard score 0.8263 c348_srcsig_mxfp4: habit score 0.8786, hard score 0.8263 c450_ryssig_q5kr4: habit score 0.8786, hard score 0.8263 c277_lateffn_iq5k: habit score 0.8786, hard score 0.8263 c590_fa_iq5ks_lateffn_iq4kss: habit score 0.8786, hard score 0.8263 c492_lateattn_q8kv: habit score 0.8786, hard score 0.8263 c429_ryspath_q60r4: habit score 0.8786, hard score 0.8263 c494_fa_iq4ks_outtok_iq4kss: habit score 0.8786, hard score 0.8219 c734_fa_iq3kt_lateffn_iq4kss: habit score 0.8786, hard score 0.8219 c171_fa_iq3kt: habit score 0.8786, hard score 0.8219 c306_outtok_iq4k: habit score 0.8786, hard score 0.8219 c471_outtok_iq4kr4: habit score 0.8786, hard score 0.8219 c241_bndall_iq3kt: habit score 0.8786, hard score 0.8219 c191_faqkv_iq3kt: habit score 0.8786, hard score 0.8219 c315_out_iq5ks: habit score 0.8786, hard score 0.8219 c692_fa_iq4kt_bndffn_iq4kss: habit score 0.8786, hard score 0.8219 c221_ryspath_iq3kt: habit score 0.8786, hard score 0.8219 c251_bndffn_iq3kt: habit score 0.8786, hard score 0.8219 c505_fa_iq4ks_rysffn_iq4ks: habit score 0.8786, hard score 0.8219 c164_fa_iq4kss: habit score 0.8786, hard score 0.8219 c170_fa_iq4kt: habit score 0.8786, hard score 0.8219 c524_fa_iq4ks_finalffn_iq4kss: habit score 0.8786, hard score 0.8219 c231_ryssig_iq3kt: habit score 0.8786, hard score 0.8219 c271_lateattn_iq3kt: habit score 0.8786, hard score 0.8219 c470_outtok_iq5ksr4: habit score 0.8786, hard score 0.8219 c259_rysffn_iq4xs: habit score 0.8786, hard score 0.8219 c290_latectl_iq4kt: habit score 0.8786, hard score 0.8219 c180_faqk_iq4kt: habit score 0.8786, hard score 0.8219 c220_ryspath_iq4kt: habit score 0.8786, hard score 0.8219 c250_bndffn_iq4kt: habit score 0.8786, hard score 0.8219 c353_dupsig_iq3kt: habit score 0.8786, hard score 0.8219 c189_faqkv_iq4xs: habit score 0.8786, hard score 0.8219 c183_faqkv_iq4ks: habit score 0.8786, hard score 0.8219 c233_bndall_iq4ks: habit score 0.8786, hard score 0.8219 c280_lateffn_iq4kt: habit score 0.8786, hard score 0.8219 c224_ryssig_iq4kss: habit score 0.8786, hard score 0.8219 c230_ryssig_iq4kt: habit score 0.8786, hard score 0.8219 c289_latectl_iq4xs: habit score 0.8786, hard score 0.8219 c332_tok_mxfp4: habit score 0.8786, hard score 0.8219 c323_tok_iq4ks: habit score 0.8786, hard score 0.8219 c182_faqk_mxfp4: habit score 0.8786, hard score 0.8219 c422_ryspath_iq4xsr8: habit score 0.8786, hard score 0.8219 c173_faqk_iq4ks: habit score 0.8786, hard score 0.8219 c384_faqk_iq4ksr4: habit score 0.8786, hard score 0.8219 c418_ryspath_iq4ksr4: habit score 0.8786, hard score 0.8219 c358_srcpath_iq4kt: habit score 0.8786, hard score 0.8219 c270_lateattn_iq4kt: habit score 0.8786, hard score 0.8219 c243_bndffn_iq4ks: habit score 0.8786, hard score 0.8219 c334_earlyattn_iq4kss: habit score 0.8786, hard score 0.8219 c340_earlyattn_iq4kt: habit score 0.8786, hard score 0.8219 c299_finalffn_iq4xs: habit score 0.8786, hard score 0.8219 c229_ryssig_iq4xs: habit score 0.8786, hard score 0.8219 c350_dupsig_iq4kss: habit score 0.8786, hard score 0.8219 c435_ryssig_iq4ksr4: habit score 0.8786, hard score 0.8219 c269_lateattn_iq4xs: habit score 0.8786, hard score 0.8219 c366_duppath_mxfp4: habit score 0.8786, hard score 0.8219 c333_earlyattn_iq4ks: habit score 0.8786, hard score 0.8219 c196_fav_iq4k: habit score 0.8786, hard score 0.8219 c354_dupsig_mxfp4: habit score 0.8786, hard score 0.8219 c349_dupsig_iq4ks: habit score 0.8786, hard score 0.8219 c307_outtok_iq5k: habit score 0.8786, hard score 0.8219 c481_outtok_q50r4: habit score 0.8786, hard score 0.8219 c484_outtok_q5kr4: habit score 0.8786, hard score 0.8219 c465_latectl_q40r8: habit score 0.8786, hard score 0.8219 c216_ryspath_iq4k: habit score 0.8786, hard score 0.8219 c236_bndall_iq4k: habit score 0.8786, hard score 0.8219 c176_faqk_iq4k: habit score 0.8786, hard score 0.8219 c197_fav_iq5k: habit score 0.8786, hard score 0.8219 c386_faqk_iq4kr4: habit score 0.8786, hard score 0.8219 c398_faqk_q4kr4: habit score 0.8786, hard score 0.8219 c440_ryssig_iq4nlr4: habit score 0.8786, hard score 0.8219 c449_ryssig_q4kr4: habit score 0.8786, hard score 0.8219 c318_out_iq6k: habit score 0.8786, hard score 0.8219 c198_fav_iq6k: habit score 0.8786, hard score 0.8219 c345_srcsig_iq5ks: habit score 0.8786, hard score 0.8219 c335_earlyattn_iq5ks: habit score 0.8786, hard score 0.8219 c627_fa_iq4k_lateffn_iq5ks: habit score 0.8786, hard score 0.8219 c633_fa_iq4k_finalffn_iq5ks: habit score 0.8786, hard score 0.8219 c225_ryssig_iq5ks: habit score 0.8786, hard score 0.8219 c436_ryssig_iq5ksr4: habit score 0.8786, hard score 0.8219 c621_fa_iq4k_bndffn_iq5ks: habit score 0.8786, hard score 0.8219 c338_earlyattn_iq6k: habit score 0.8786, hard score 0.8219 c453_latectl_iq5ksr4: habit score 0.8786, hard score 0.8219 c385_faqk_iq5ksr4: habit score 0.8786, hard score 0.8219 c421_ryspath_iq5kr4: habit score 0.8786, hard score 0.8219 c187_faqkv_iq5k: habit score 0.8786, hard score 0.8219 c387_faqk_iq5kr4: habit score 0.8786, hard score 0.8219 c327_tok_iq5k: habit score 0.8786, hard score 0.8219 c368_fa_iq5ksr4: habit score 0.8786, hard score 0.8219 c237_bndall_iq5k: habit score 0.8786, hard score 0.8219 c228_ryssig_iq6k: habit score 0.8786, hard score 0.8219 c278_lateffn_iq6k: habit score 0.8786, hard score 0.8219 c676_fa_iq4kt_outtok_q6k: habit score 0.8786, hard score 0.8219 c539_fa_iq4kss_ryssig_q8: habit score 0.8786, hard score 0.8219 c428_ryspath_q60: habit score 0.8786, hard score 0.8219 c248_bndffn_iq6k: habit score 0.8786, hard score 0.8219 c434_ryspath_q6kr4: habit score 0.8786, hard score 0.8219 c218_ryspath_iq6k: habit score 0.8786, hard score 0.8219 c444_ryssig_q8kvr8: habit score 0.8786, hard score 0.8219 c463_latectl_q60r4: habit score 0.8786, hard score 0.8219 c395_faqk_q60r4: habit score 0.8786, hard score 0.8219 c480_outtok_q60r4: habit score 0.8786, hard score 0.8219 c443_ryssig_q8kr8: habit score 0.8786, hard score 0.8219 c411_faqkv_q60: habit score 0.8786, hard score 0.8219 c412_faqkv_q60r4: habit score 0.8786, hard score 0.8219 c288_latectl_iq6k: habit score 0.8786, hard score 0.8219 c417_faqkv_q6kr4: habit score 0.8786, hard score 0.8219 c442_ryssig_q8r8: habit score 0.8786, hard score 0.8219 c308_outtok_iq6k: habit score 0.8786, hard score 0.8219 c35_bndall_q6k: habit score 0.8786, hard score 0.8219 c238_bndall_iq6k: habit score 0.8786, hard score 0.8219 c427_ryspath_q8kvr8: habit score 0.8786, hard score 0.8219 c461_latectl_q8kvr8: habit score 0.8786, hard score 0.8219 c460_latectl_q8kr8: habit score 0.8786, hard score 0.8219 c688_fa_iq4kt_rysffn_q6k: habit score 0.8786, hard score 0.8219 c599_fa_iq5ks_finalffn_q8: habit score 0.8786, hard score 0.8219 c409_faqkv_q8kr8: habit score 0.8786, hard score 0.8219 c408_faqkv_q8r8: habit score 0.8786, hard score 0.8219 c03_habit_boundary_q8: habit score 0.8786, hard score 0.8219 c45_lateattn_bf16: habit score 0.8786, hard score 0.8219 c478_outtok_q8kvr8: habit score 0.8786, hard score 0.8219 c580_fa_iq5ks_rysffn_q6k: habit score 0.8786, hard score 0.8219 c477_outtok_q8kr8: habit score 0.8786, hard score 0.8219 c702_fa_iq4kt_lateffn_bf16: habit score 0.8786, hard score 0.8219 c476_outtok_q8r8: habit score 0.8786, hard score 0.8219 c689_fa_iq4kt_rysffn_q8: habit score 0.8786, hard score 0.8219 c89_fa_q8_ryssig_q8: habit score 0.8786, hard score 0.8219 c672_fa_iq5k_finalffn_bf16: habit score 0.8786, hard score 0.8219 c114_fao_bf16_faqk_q8: habit score 0.8786, hard score 0.8219 c15_faqk_bf16: habit score 0.8786, hard score 0.8219 c87_fa_q8_outtok_q8: habit score 0.8786, hard score 0.8219 c101_fa_bf16_ryssig_q8: habit score 0.8786, hard score 0.8219 c534_fa_iq4kss_outtok_bf16: habit score 0.8786, hard score 0.8219 c105_fa_bf16_bndffn_q8: habit score 0.8786, hard score 0.8219 c99_fa_bf16_outtok_q8: habit score 0.8786, hard score 0.8219 c102_fa_bf16_ryssig_bf16: habit score 0.8786, hard score 0.8219 c319_out_iq4xs: habit score 0.9143, hard score 0.7752 c644_fa_iq5k_ryssig_iq4kss: habit score 0.9143, hard score 0.7752 c142_fa_q8_outtok_q8_lateffn_q8: habit score 0.9143, hard score 0.7752 c148_fa_bf16_outtok_q8_lateffn_q8: habit score 0.9143, hard score 0.7752 c211_fao_iq3kt: habit score 0.8929, hard score 0.7996 c466_latectl_q4kr4: habit score 0.8786, hard score 0.8174 c298_finalffn_iq6k: habit score 0.8786, hard score 0.8174 c722_fa_iq3kt_rysffn_iq4kss: habit score 0.8929, hard score 0.7935 c262_rysffn_mxfp4: habit score 0.8786, hard score 0.8061 c309_outtok_iq4xs: habit score 0.8786, hard score 0.8041 c482_outtok_q40r8: habit score 0.8786, hard score 0.8041 c424_ryspath_iq3kr4: habit score 0.8786, hard score 0.8041 c302_finalffn_mxfp4: habit score 0.8786, hard score 0.8041 c282_lateffn_mxfp4: habit score 0.8786, hard score 0.8041 c232_ryssig_mxfp4: habit score 0.8786, hard score 0.8041 c381_fa_q4kr4: habit score 0.8786, hard score 0.8041 c195_fav_iq5ks: habit score 0.8786, hard score 0.8041 c419_ryspath_iq5ksr4: habit score 0.8786, hard score 0.8041 c321_out_iq3kt: habit score 0.8786, hard score 0.8035 c346_srcsig_iq4kt: habit score 0.8786, hard score 0.8035 c295_finalffn_iq5ks: habit score 0.8786, hard score 0.8035 c287_latectl_iq5k: habit score 0.8786, hard score 0.8019 c591_fa_iq5ks_lateffn_iq5ks: habit score 0.8786, hard score 0.8019 c393_faqk_q8kvr8: habit score 0.8786, hard score 0.8019 c475_outtok_iq3kr4: habit score 0.8786, hard score 0.7996 c529_fa_iq4kss_outtok_iq4ks: habit score 0.8786, hard score 0.7996 c473_outtok_iq4xsr8: habit score 0.8786, hard score 0.7996 c303_outtok_iq4ks: habit score 0.8786, hard score 0.7996 c313_out_iq4ks: habit score 0.8786, hard score 0.7996 c316_out_iq4k: habit score 0.8786, hard score 0.7996 c483_outtok_q4kr4: habit score 0.8786, hard score 0.7996 c741_fa_iq3kt_finalffn_iq5ks: habit score 0.8786, hard score 0.7996 c331_tok_iq3kt: habit score 0.8786, hard score 0.7996 c291_latectl_iq3kt: habit score 0.8786, hard score 0.7996 c181_faqk_iq3kt: habit score 0.8786, hard score 0.7996 c554_fa_iq4kss_lateffn_iq4kss: habit score 0.8786, hard score 0.7996 c698_fa_iq4kt_lateffn_iq4kss: habit score 0.8786, hard score 0.7996 c680_fa_iq4kt_ryssig_iq4kss: habit score 0.8786, hard score 0.7996 c260_rysffn_iq4kt: habit score 0.8786, hard score 0.7996 c559_fa_iq4kss_finalffn_iq4ks: habit score 0.8786, hard score 0.7996 c301_finalffn_iq3kt: habit score 0.8786, hard score 0.7996 c511_fa_iq4ks_bndffn_iq4ks: habit score 0.8786, hard score 0.7996 c365_duppath_iq3kt: habit score 0.8786, hard score 0.7996 c305_outtok_iq5ks: habit score 0.8786, hard score 0.7996 c284_latectl_iq4kss: habit score 0.8786, hard score 0.7996 c324_tok_iq4kss: habit score 0.8786, hard score 0.7996 c341_earlyattn_iq3kt: habit score 0.8786, hard score 0.7996 c367_fa_iq4ksr4: habit score 0.8786, hard score 0.7996 c347_srcsig_iq3kt: habit score 0.8786, hard score 0.7996 c730_fa_iq3kt_bndffn_q6k: habit score 0.8786, hard score 0.7996 c405_faqkv_iq4xsr8: habit score 0.8786, hard score 0.7996 c401_faqkv_iq4ksr4: habit score 0.8786, hard score 0.7996 c452_latectl_iq4ksr4: habit score 0.8786, hard score 0.7996 c179_faqk_iq4xs: habit score 0.8786, hard score 0.7996 c362_duppath_iq4kss: habit score 0.8786, hard score 0.7996 c249_bndffn_iq4xs: habit score 0.8786, hard score 0.7996 c549_fa_iq4kss_bndffn_iq5ks: habit score 0.8786, hard score 0.7996 c439_ryssig_iq4xsr8: habit score 0.8786, hard score 0.7996 c344_srcsig_iq4kss: habit score 0.8786, hard score 0.7996 c352_dupsig_iq4kt: habit score 0.8786, hard score 0.7996 c223_ryssig_iq4ks: habit score 0.8786, hard score 0.7996 c272_lateattn_mxfp4: habit score 0.8786, hard score 0.7996 c263_lateattn_iq4ks: habit score 0.8786, hard score 0.7996 c360_srcpath_mxfp4: habit score 0.8786, hard score 0.7996 c355_srcpath_iq4ks: habit score 0.8786, hard score 0.7996 c361_duppath_iq4ks: habit score 0.8786, hard score 0.7996 c212_fao_mxfp4: habit score 0.8786, hard score 0.7996 c342_earlyattn_mxfp4: habit score 0.8786, hard score 0.7996 c199_fav_iq4xs: habit score 0.8786, hard score 0.7996 c372_fa_iq4nlr4: habit score 0.8786, hard score 0.7996 c380_fa_q40r8: habit score 0.8786, hard score 0.7996 c403_faqkv_iq4kr4: habit score 0.8786, hard score 0.7996 c343_srcsig_iq4ks: habit score 0.8786, hard score 0.7996 c266_lateattn_iq4k: habit score 0.8786, hard score 0.7996 c286_latectl_iq4k: habit score 0.8786, hard score 0.7996 c420_ryspath_iq4kr4: habit score 0.8786, hard score 0.7996 c246_bndffn_iq4k: habit score 0.8786, hard score 0.7996 c276_lateffn_iq4k: habit score 0.8786, hard score 0.7996 c296_finalffn_iq4k: habit score 0.8786, hard score 0.7996 c326_tok_iq4k: habit score 0.8786, hard score 0.7996 c389_faqk_iq4nlr4: habit score 0.8786, hard score 0.7996 c437_ryssig_iq4kr4: habit score 0.8786, hard score 0.7996 c489_fav_q8kv: habit score 0.8786, hard score 0.7996 c351_dupsig_iq5ks: habit score 0.8786, hard score 0.7996 c265_lateattn_iq5ks: habit score 0.8786, hard score 0.7996 c337_earlyattn_iq5k: habit score 0.8786, hard score 0.7996 c363_duppath_iq5ks: habit score 0.8786, hard score 0.7996 c275_lateffn_iq5ks: habit score 0.8786, hard score 0.7996 c227_ryssig_iq5k: habit score 0.8786, hard score 0.7996 c438_ryssig_iq5kr4: habit score 0.8786, hard score 0.7996 c584_fa_iq5ks_bndffn_iq4kss: habit score 0.8786, hard score 0.7996 c215_ryspath_iq5ks: habit score 0.8786, hard score 0.7996 c285_latectl_iq5ks: habit score 0.8786, hard score 0.7996 c325_tok_iq5ks: habit score 0.8786, hard score 0.7996 c247_bndffn_iq5k: habit score 0.8786, hard score 0.7996 c268_lateattn_iq6k: habit score 0.8786, hard score 0.7996 c177_faqk_iq5k: habit score 0.8786, hard score 0.7996 c396_faqk_q50r4: habit score 0.8786, hard score 0.7996 c416_faqkv_q5kr4: habit score 0.8786, hard score 0.7996 c445_ryssig_q60: habit score 0.8786, hard score 0.7996 c446_ryssig_q60r4: habit score 0.8786, hard score 0.7996 c451_ryssig_q6kr4: habit score 0.8786, hard score 0.7996 c394_faqk_q60: habit score 0.8786, hard score 0.7996 c479_outtok_q60: habit score 0.8786, hard score 0.7996 c485_outtok_q6kr4: habit score 0.8786, hard score 0.7996 c328_tok_iq6k: habit score 0.8786, hard score 0.7996 c188_faqkv_iq6k: habit score 0.8786, hard score 0.7996 c426_ryspath_q8kr8: habit score 0.8786, hard score 0.7996 c575_fa_iq5ks_ryssig_q8: habit score 0.8786, hard score 0.7996 c392_faqk_q8kr8: habit score 0.8786, hard score 0.7996 c459_latectl_q8r8: habit score 0.8786, hard score 0.7996 c738_fa_iq3kt_lateffn_bf16: habit score 0.8786, hard score 0.7996 c63_fa_q5k_outtok_q8: habit score 0.8786, hard score 0.7996 c09_union_rys_path_q8_late_control_q8: habit score 0.8786, hard score 0.7996 c714_fa_iq3kt_outtok_bf16: habit score 0.8786, hard score 0.7996 c690_fa_iq4kt_rysffn_bf16: habit score 0.8786, hard score 0.7996 c582_fa_iq5ks_rysffn_bf16: habit score 0.8786, hard score 0.7996 c757_fa_mxfp4_rysffn_iq4ks: habit score 0.8786, hard score 0.798 c310_outtok_iq4kt: habit score 0.8786, hard score 0.7974 c359_srcpath_iq3kt: habit score 0.8786, hard score 0.7974 c441_ryssig_iq3kr4: habit score 0.8786, hard score 0.7974 c256_rysffn_iq4k: habit score 0.8786, hard score 0.7974 c235_bndall_iq5ks: habit score 0.8786, hard score 0.7974 c589_fa_iq5ks_lateffn_iq4ks: habit score 0.8786, hard score 0.7974 c661_fa_iq5k_lateffn_iq4ks: habit score 0.8786, hard score 0.7974 c258_rysffn_iq6k: habit score 0.8786, hard score 0.7974 c283_latectl_iq4ks: habit score 0.8786, hard score 0.7952 c356_srcpath_iq4kss: habit score 0.8786, hard score 0.7952 c414_faqkv_q40r8: habit score 0.8786, hard score 0.7952 c468_latectl_q6kr4: habit score 0.8786, hard score 0.7952 c112_fav_bf16_fao_bf16: habit score 0.8786, hard score 0.7952 c264_lateattn_iq4kss: habit score 0.8786, hard score 0.7935 c69_fa_q5k_bndffn_q8: habit score 0.8786, hard score 0.7915 c430_ryspath_q50r4: habit score 0.8786, hard score 0.7907 c320_out_iq4kt: habit score 0.8786, hard score 0.7874 c425_ryspath_q8r8: habit score 0.8786, hard score 0.7841 c588_fa_iq5ks_bndffn_bf16: habit score 0.9143, hard score 0.7352 c96_fa_q8_lateffn_bf16: habit score 0.9143, hard score 0.7352 c255_rysffn_iq5ks: habit score 0.8786, hard score 0.7813 c455_latectl_iq5kr4: habit score 0.8786, hard score 0.7796 c390_faqk_iq3kr4: habit score 0.8786, hard score 0.7774 c447_ryssig_q50r4: habit score 0.8786, hard score 0.7769 c469_outtok_iq4ksr4: habit score 0.8429, hard score 0.8219 c294_finalffn_iq4kss: habit score 0.8429, hard score 0.8219 c292_latectl_mxfp4: habit score 0.8429, hard score 0.8219 c194_fav_iq4kss: habit score 0.8429, hard score 0.8219 c406_faqkv_iq4nlr4: habit score 0.8429, hard score 0.8219 c707_fa_iq4kt_finalffn_q8: habit score 0.8429, hard score 0.8219 c462_latectl_q60: habit score 0.8429, hard score 0.8219 c240_bndall_iq4kt: habit score 0.8786, hard score 0.7752 c432_ryspath_q4kr4: habit score 0.8786, hard score 0.7752 c448_ryssig_q40r8: habit score 0.8786, hard score 0.7752 c399_faqk_q5kr4: habit score 0.8786, hard score 0.7752 c490_ryspath_q8kv: habit score 0.8786, hard score 0.7752 c410_faqkv_q8kvr8: habit score 0.8786, hard score 0.7752 c126_bndall_bf16_lateffn_q8: habit score 0.8786, hard score 0.7752 c257_rysffn_iq5k: habit score 0.8614, hard score 0.7974 c710_fa_iq3kt_outtok_iq4kss: habit score 0.8571, hard score 0.7996 c400_faqk_q6kr4: habit score 0.8786, hard score 0.7663 c261_rysffn_iq3kt: habit score 0.85, hard score 0.7996 c388_faqk_iq4xsr8: habit score 0.85, hard score 0.7996 c202_fav_mxfp4: habit score 0.8429, hard score 0.8078 c83_fa_q6k_lateffn_q8: habit score 0.9143, hard score 0.713 c293_finalffn_iq4ks: habit score 0.8429, hard score 0.7996 c694_fa_iq4kt_bndffn_q6k: habit score 0.8429, hard score 0.7996 c745_fa_mxfp4_outtok_iq4ks: habit score 0.8429, hard score 0.798 c517_fa_iq4ks_lateffn_iq4ks: habit score 0.8786, hard score 0.7352 c163_fa_iq4ks: habit score 0.8786, hard score 0.7352 c186_faqkv_iq4k: habit score 0.8786, hard score 0.7352 c454_latectl_iq4kr4: habit score 0.8786, hard score 0.7352 c457_latectl_iq4nlr4: habit score 0.8786, hard score 0.7352 c297_finalffn_iq5k: habit score 0.8786, hard score 0.7352 c175_faqk_iq5ks: habit score 0.8786, hard score 0.7352 c217_ryspath_iq5k: habit score 0.8786, hard score 0.7352 c585_fa_iq5ks_bndffn_iq5ks: habit score 0.8786, hard score 0.7352 c586_fa_iq5ks_bndffn_q6k: habit score 0.8786, hard score 0.7352 c168_fa_iq6k: habit score 0.8786, hard score 0.7352 c684_fa_iq4kt_ryssig_bf16: habit score 0.8786, hard score 0.7352 c374_fa_q8r8: habit score 0.8786, hard score 0.7352 c522_fa_iq4ks_lateffn_bf16: habit score 0.8786, hard score 0.7352 c594_fa_iq5ks_lateffn_bf16: habit score 0.8786, hard score 0.7352 c371_fa_iq4xsr8: habit score 0.8429, hard score 0.7752 c245_bndffn_iq5ks: habit score 0.8429, hard score 0.7752 c165_fa_iq5ks: habit score 0.8786, hard score 0.7174 c304_outtok_iq4kss: habit score 0.8786, hard score 0.713 c415_faqkv_q4kr4: habit score 0.8786, hard score 0.713 c357_srcpath_iq5ks: habit score 0.8786, hard score 0.713 c185_faqkv_iq5ks: habit score 0.8786, hard score 0.713 c402_faqkv_iq5ksr4: habit score 0.8786, hard score 0.713 c726_fa_iq3kt_rysffn_bf16: habit score 0.8786, hard score 0.713 c314_out_iq4kss: habit score 0.8429, hard score 0.7352 c458_latectl_iq3kr4: habit score 0.8786, hard score 0.6885 c487_faqk_q8kv: habit score 0.8786, hard score 0.6885 c222_ryspath_mxfp4: habit score 0.8571, hard score 0.693 c376_fa_q8kvr8: habit score 0.8786, hard score 0.6863 c378_fa_q60r4: habit score 0.9143, hard score 0.5996 c383_fa_q6kr4: habit score 0.9143, hard score 0.5996 c486_fa_q8kv: habit score 0.7357, hard score 0.6078 c375_fa_q8kr8: habit score 0.7357, hard score 0.5996 c687_fa_iq4kt_rysffn_iq5ks: habit score 0.6286, hard score 0.51 c488_faqkv_q8kv: habit score 0, hard score 0.3774 c581_fa_iq5ks_rysffn_q8: habit score 0.1286, hard score 0.4167 habit score hard score
Comprehensive score vs file size c373_fa_iq3kr4: size GB 16.29, weighted score 0.8935 c281_lateffn_iq3kt: size GB 16.43, weighted score 0.8916 c718_fa_iq3kt_ryssig_q6k: size GB 16.4, weighted score 0.8822 c204_fao_iq4kss: size GB 16.52, weighted score 0.8761 c311_outtok_iq3kt: size GB 15.79, weighted score 0.8741 c317_out_iq5k: size GB 16.39, weighted score 0.8741 c254_rysffn_iq4kss: size GB 16.39, weighted score 0.8741 c500_fa_iq4ks_ryssig_iq4kss: size GB 16.44, weighted score 0.8741 c234_bndall_iq4kss: size GB 16.46, weighted score 0.8741 c253_rysffn_iq4ks: size GB 16.47, weighted score 0.8741 c169_fa_iq4xs: size GB 16.48, weighted score 0.8741 c274_lateffn_iq4kss: size GB 16.51, weighted score 0.8741 c456_latectl_iq4xsr8: size GB 16.51, weighted score 0.8741 c213_ryspath_iq4ks: size GB 16.52, weighted score 0.8741 c364_duppath_iq4kt: size GB 16.52, weighted score 0.8741 c201_fav_iq3kt: size GB 16.53, weighted score 0.8741 c273_lateffn_iq4ks: size GB 16.53, weighted score 0.8741 c200_fav_iq4kt: size GB 16.54, weighted score 0.8741 c339_earlyattn_iq4xs: size GB 16.54, weighted score 0.8741 c193_fav_iq4ks: size GB 16.54, weighted score 0.8741 c369_fa_iq4kr4: size GB 16.54, weighted score 0.8741 c336_earlyattn_iq4k: size GB 16.55, weighted score 0.8741 c423_ryspath_iq4nlr4: size GB 16.55, weighted score 0.8741 c431_ryspath_q40r8: size GB 16.55, weighted score 0.8741 c206_fao_iq4k: size GB 16.55, weighted score 0.8741 c226_ryssig_iq4k: size GB 16.55, weighted score 0.8741 c397_faqk_q40r8: size GB 16.55, weighted score 0.8741 c205_fao_iq5ks: size GB 16.61, weighted score 0.8741 c433_ryspath_q5kr4: size GB 16.69, weighted score 0.8741 c649_fa_iq5k_rysffn_iq4ks: size GB 16.7, weighted score 0.8741 c21_fao_q6k: size GB 16.7, weighted score 0.8741 c208_fao_iq6k: size GB 16.71, weighted score 0.8741 c404_faqkv_iq5kr4: size GB 16.71, weighted score 0.8741 c610_fa_iq4k_ryssig_q6k: size GB 16.72, weighted score 0.8741 c655_fa_iq5k_bndffn_iq4ks: size GB 16.75, weighted score 0.8741 c167_fa_iq5k: size GB 16.78, weighted score 0.8741 c370_fa_iq5kr4: size GB 16.78, weighted score 0.8741 c379_fa_q50r4: size GB 16.78, weighted score 0.8741 c22_fao_q8: size GB 16.84, weighted score 0.8741 c391_faqk_q8r8: size GB 17.17, weighted score 0.8741 c616_fa_iq4k_rysffn_q6k: size GB 17.23, weighted score 0.8741 c587_fa_iq5ks_bndffn_q8: size GB 17.26, weighted score 0.8741 c77_fa_q6k_ryssig_q8: size GB 17.38, weighted score 0.8741 c666_fa_iq5k_lateffn_bf16: size GB 17.8, weighted score 0.8741 c93_fa_q8_bndffn_q8: size GB 18.02, weighted score 0.8741 c154_fao_bf16_faqk_q8_lateffn_q8: size GB 18.34, weighted score 0.8741 c06_output_token_q8_fullattn_q8: size GB 18.43, weighted score 0.8741 c05_all_fullattn_bf16: size GB 19.26, weighted score 0.8741 c56_outtok_bf16: size GB 19.88, weighted score 0.8741 c145_fa_bf16_outtok_q8_ryssig_q8: size GB 20.55, weighted score 0.8741 c147_fa_bf16_outtok_q8_bndffn_q8: size GB 20.73, weighted score 0.8741 c106_fa_bf16_bndffn_bf16: size GB 20.79, weighted score 0.8741 c88_fa_q8_outtok_bf16: size GB 20.81, weighted score 0.8741 c100_fa_bf16_outtok_bf16: size GB 22.58, weighted score 0.8741 c172_fa_mxfp4: size GB 16.48, weighted score 0.8734 c192_faqkv_mxfp4: size GB 16.5, weighted score 0.8734 c239_bndall_iq4xs: size GB 16.51, weighted score 0.8722 c377_fa_q60: size GB 17.01, weighted score 0.8722 c413_faqkv_q50r4: size GB 16.71, weighted score 0.8715 c207_fao_iq5k: size GB 16.63, weighted score 0.8683 c312_outtok_mxfp4: size GB 16.15, weighted score 0.8678 c300_finalffn_iq4kt: size GB 16.51, weighted score 0.8664 c474_outtok_iq4nlr4: size GB 16.23, weighted score 0.8645 c691_fa_iq4kt_bndffn_iq4ks: size GB 16.39, weighted score 0.8645 c184_faqkv_iq4kss: size GB 16.46, weighted score 0.8645 c330_tok_iq4kt: size GB 16.48, weighted score 0.8645 c174_faqk_iq4kss: size GB 16.48, weighted score 0.8645 c244_bndffn_iq4kss: size GB 16.49, weighted score 0.8645 c329_tok_iq4xs: size GB 16.52, weighted score 0.8645 c219_ryspath_iq4xs: size GB 16.52, weighted score 0.8645 c210_fao_iq4kt: size GB 16.52, weighted score 0.8645 c252_bndffn_mxfp4: size GB 16.52, weighted score 0.8645 c279_lateffn_iq4xs: size GB 16.53, weighted score 0.8645 c209_fao_iq4xs: size GB 16.54, weighted score 0.8645 c203_fao_iq4ks: size GB 16.54, weighted score 0.8645 c166_fa_iq4k: size GB 16.54, weighted score 0.8645 c472_outtok_iq5kr4: size GB 16.54, weighted score 0.8645 c267_lateattn_iq5k: size GB 16.62, weighted score 0.8645 c583_fa_iq5ks_bndffn_iq4ks: size GB 16.69, weighted score 0.8645 c639_fa_iq5k_outtok_iq5ks: size GB 16.69, weighted score 0.8645 c464_latectl_q50r4: size GB 16.71, weighted score 0.8645 c467_latectl_q5kr4: size GB 16.71, weighted score 0.8645 c382_fa_q5kr4: size GB 16.78, weighted score 0.8645 c491_ryssig_q8kv: size GB 16.86, weighted score 0.8645 c604_fa_iq4k_outtok_q6k: size GB 16.87, weighted score 0.8645 c178_faqk_iq6k: size GB 16.88, weighted score 0.8645 c111_fav_bf16_fao_q8: size GB 16.96, weighted score 0.8645 c544_fa_iq4kss_rysffn_q6k: size GB 17.12, weighted score 0.8645 c23_fao_bf16: size GB 17.37, weighted score 0.8645 c564_fa_iq4kss_finalffn_bf16: size GB 17.58, weighted score 0.8645 c569_fa_iq5ks_outtok_q8: size GB 17.67, weighted score 0.8645 c152_fav_bf16_fao_bf16_bndffn_q8: size GB 18.03, weighted score 0.8645 c151_fav_bf16_fao_bf16_outtok_q8: size GB 18.44, weighted score 0.8645 c141_fa_q8_outtok_q8_bndffn_q8: size GB 18.97, weighted score 0.8645 c64_fa_q5k_outtok_bf16: size GB 20.11, weighted score 0.8645 c76_fa_q6k_outtok_bf16: size GB 20.36, weighted score 0.8645 c751_fa_mxfp4_ryssig_iq4ks: size GB 16.46, weighted score 0.8638 c592_fa_iq5ks_lateffn_q6k: size GB 16.91, weighted score 0.8635 c322_out_mxfp4: size GB 16.19, weighted score 0.8581 c407_faqkv_iq3kr4: size GB 16.37, weighted score 0.8575 c214_ryspath_iq4kss: size GB 16.48, weighted score 0.8575 c190_faqkv_iq4kt: size GB 16.46, weighted score 0.8559 c242_bndall_mxfp4: size GB 16.51, weighted score 0.8559 c348_srcsig_mxfp4: size GB 16.54, weighted score 0.8559 c450_ryssig_q5kr4: size GB 16.64, weighted score 0.8559 c277_lateffn_iq5k: size GB 16.64, weighted score 0.8559 c590_fa_iq5ks_lateffn_iq4kss: size GB 16.68, weighted score 0.8559 c492_lateattn_q8kv: size GB 16.78, weighted score 0.8559 c429_ryspath_q60r4: size GB 16.83, weighted score 0.8559 c494_fa_iq4ks_outtok_iq4kss: size GB 16, weighted score 0.8539 c734_fa_iq3kt_lateffn_iq4kss: size GB 16.18, weighted score 0.8539 c171_fa_iq3kt: size GB 16.22, weighted score 0.8539 c306_outtok_iq4k: size GB 16.23, weighted score 0.8539 c471_outtok_iq4kr4: size GB 16.23, weighted score 0.8539 c241_bndall_iq3kt: size GB 16.3, weighted score 0.8539 c191_faqkv_iq3kt: size GB 16.32, weighted score 0.8539 c315_out_iq5ks: size GB 16.35, weighted score 0.8539 c692_fa_iq4kt_bndffn_iq4kss: size GB 16.36, weighted score 0.8539 c221_ryspath_iq3kt: size GB 16.36, weighted score 0.8539 c251_bndffn_iq3kt: size GB 16.37, weighted score 0.8539 c505_fa_iq4ks_rysffn_iq4ks: size GB 16.4, weighted score 0.8539 c164_fa_iq4kss: size GB 16.43, weighted score 0.8539 c170_fa_iq4kt: size GB 16.43, weighted score 0.8539 c524_fa_iq4ks_finalffn_iq4kss: size GB 16.44, weighted score 0.8539 c231_ryssig_iq3kt: size GB 16.44, weighted score 0.8539 c271_lateattn_iq3kt: size GB 16.46, weighted score 0.8539 c470_outtok_iq5ksr4: size GB 16.47, weighted score 0.8539 c259_rysffn_iq4xs: size GB 16.47, weighted score 0.8539 c290_latectl_iq4kt: size GB 16.47, weighted score 0.8539 c180_faqk_iq4kt: size GB 16.48, weighted score 0.8539 c220_ryspath_iq4kt: size GB 16.48, weighted score 0.8539 c250_bndffn_iq4kt: size GB 16.49, weighted score 0.8539 c353_dupsig_iq3kt: size GB 16.5, weighted score 0.8539 c189_faqkv_iq4xs: size GB 16.5, weighted score 0.8539 c183_faqkv_iq4ks: size GB 16.5, weighted score 0.8539 c233_bndall_iq4ks: size GB 16.51, weighted score 0.8539 c280_lateffn_iq4kt: size GB 16.51, weighted score 0.8539 c224_ryssig_iq4kss: size GB 16.51, weighted score 0.8539 c230_ryssig_iq4kt: size GB 16.51, weighted score 0.8539 c289_latectl_iq4xs: size GB 16.51, weighted score 0.8539 c332_tok_mxfp4: size GB 16.52, weighted score 0.8539 c323_tok_iq4ks: size GB 16.52, weighted score 0.8539 c182_faqk_mxfp4: size GB 16.52, weighted score 0.8539 c422_ryspath_iq4xsr8: size GB 16.52, weighted score 0.8539 c173_faqk_iq4ks: size GB 16.52, weighted score 0.8539 c384_faqk_iq4ksr4: size GB 16.52, weighted score 0.8539 c418_ryspath_iq4ksr4: size GB 16.52, weighted score 0.8539 c358_srcpath_iq4kt: size GB 16.52, weighted score 0.8539 c270_lateattn_iq4kt: size GB 16.52, weighted score 0.8539 c243_bndffn_iq4ks: size GB 16.52, weighted score 0.8539 c334_earlyattn_iq4kss: size GB 16.53, weighted score 0.8539 c340_earlyattn_iq4kt: size GB 16.53, weighted score 0.8539 c299_finalffn_iq4xs: size GB 16.53, weighted score 0.8539 c229_ryssig_iq4xs: size GB 16.53, weighted score 0.8539 c350_dupsig_iq4kss: size GB 16.53, weighted score 0.8539 c435_ryssig_iq4ksr4: size GB 16.53, weighted score 0.8539 c269_lateattn_iq4xs: size GB 16.54, weighted score 0.8539 c366_duppath_mxfp4: size GB 16.54, weighted score 0.8539 c333_earlyattn_iq4ks: size GB 16.54, weighted score 0.8539 c196_fav_iq4k: size GB 16.54, weighted score 0.8539 c354_dupsig_mxfp4: size GB 16.54, weighted score 0.8539 c349_dupsig_iq4ks: size GB 16.54, weighted score 0.8539 c307_outtok_iq5k: size GB 16.54, weighted score 0.8539 c481_outtok_q50r4: size GB 16.54, weighted score 0.8539 c484_outtok_q5kr4: size GB 16.54, weighted score 0.8539 c465_latectl_q40r8: size GB 16.55, weighted score 0.8539 c216_ryspath_iq4k: size GB 16.55, weighted score 0.8539 c236_bndall_iq4k: size GB 16.55, weighted score 0.8539 c176_faqk_iq4k: size GB 16.55, weighted score 0.8539 c197_fav_iq5k: size GB 16.55, weighted score 0.8539 c386_faqk_iq4kr4: size GB 16.55, weighted score 0.8539 c398_faqk_q4kr4: size GB 16.55, weighted score 0.8539 c440_ryssig_iq4nlr4: size GB 16.55, weighted score 0.8539 c449_ryssig_q4kr4: size GB 16.55, weighted score 0.8539 c318_out_iq6k: size GB 16.56, weighted score 0.8539 c198_fav_iq6k: size GB 16.57, weighted score 0.8539 c345_srcsig_iq5ks: size GB 16.59, weighted score 0.8539 c335_earlyattn_iq5ks: size GB 16.59, weighted score 0.8539 c627_fa_iq4k_lateffn_iq5ks: size GB 16.61, weighted score 0.8539 c633_fa_iq4k_finalffn_iq5ks: size GB 16.62, weighted score 0.8539 c225_ryssig_iq5ks: size GB 16.62, weighted score 0.8539 c436_ryssig_iq5ksr4: size GB 16.62, weighted score 0.8539 c621_fa_iq4k_bndffn_iq5ks: size GB 16.64, weighted score 0.8539 c338_earlyattn_iq6k: size GB 16.66, weighted score 0.8539 c453_latectl_iq5ksr4: size GB 16.67, weighted score 0.8539 c385_faqk_iq5ksr4: size GB 16.67, weighted score 0.8539 c421_ryspath_iq5kr4: size GB 16.69, weighted score 0.8539 c187_faqkv_iq5k: size GB 16.71, weighted score 0.8539 c387_faqk_iq5kr4: size GB 16.71, weighted score 0.8539 c327_tok_iq5k: size GB 16.71, weighted score 0.8539 c368_fa_iq5ksr4: size GB 16.72, weighted score 0.8539 c237_bndall_iq5k: size GB 16.74, weighted score 0.8539 c228_ryssig_iq6k: size GB 16.74, weighted score 0.8539 c278_lateffn_iq6k: size GB 16.74, weighted score 0.8539 c676_fa_iq4kt_outtok_q6k: size GB 16.75, weighted score 0.8539 c539_fa_iq4kss_ryssig_q8: size GB 16.77, weighted score 0.8539 c428_ryspath_q60: size GB 16.83, weighted score 0.8539 c248_bndffn_iq6k: size GB 16.84, weighted score 0.8539 c434_ryspath_q6kr4: size GB 16.84, weighted score 0.8539 c218_ryspath_iq6k: size GB 16.85, weighted score 0.8539 c444_ryssig_q8kvr8: size GB 16.86, weighted score 0.8539 c463_latectl_q60r4: size GB 16.86, weighted score 0.8539 c395_faqk_q60r4: size GB 16.86, weighted score 0.8539 c480_outtok_q60r4: size GB 16.86, weighted score 0.8539 c443_ryssig_q8kr8: size GB 16.86, weighted score 0.8539 c411_faqkv_q60: size GB 16.87, weighted score 0.8539 c412_faqkv_q60r4: size GB 16.87, weighted score 0.8539 c288_latectl_iq6k: size GB 16.88, weighted score 0.8539 c417_faqkv_q6kr4: size GB 16.88, weighted score 0.8539 c442_ryssig_q8r8: size GB 16.9, weighted score 0.8539 c308_outtok_iq6k: size GB 16.9, weighted score 0.8539 c35_bndall_q6k: size GB 16.94, weighted score 0.8539 c238_bndall_iq6k: size GB 16.95, weighted score 0.8539 c427_ryspath_q8kvr8: size GB 17.04, weighted score 0.8539 c461_latectl_q8kvr8: size GB 17.09, weighted score 0.8539 c460_latectl_q8kr8: size GB 17.1, weighted score 0.8539 c688_fa_iq4kt_rysffn_q6k: size GB 17.12, weighted score 0.8539 c599_fa_iq5ks_finalffn_q8: size GB 17.12, weighted score 0.8539 c409_faqkv_q8kr8: size GB 17.13, weighted score 0.8539 c408_faqkv_q8r8: size GB 17.2, weighted score 0.8539 c03_habit_boundary_q8: size GB 17.3, weighted score 0.8539 c45_lateattn_bf16: size GB 17.31, weighted score 0.8539 c478_outtok_q8kvr8: size GB 17.34, weighted score 0.8539 c580_fa_iq5ks_rysffn_q6k: size GB 17.41, weighted score 0.8539 c477_outtok_q8kr8: size GB 17.43, weighted score 0.8539 c702_fa_iq4kt_lateffn_bf16: size GB 17.45, weighted score 0.8539 c476_outtok_q8r8: size GB 17.5, weighted score 0.8539 c689_fa_iq4kt_rysffn_q8: size GB 17.76, weighted score 0.8539 c89_fa_q8_ryssig_q8: size GB 17.83, weighted score 0.8539 c672_fa_iq5k_finalffn_bf16: size GB 17.93, weighted score 0.8539 c114_fao_bf16_faqk_q8: size GB 17.98, weighted score 0.8539 c15_faqk_bf16: size GB 18.32, weighted score 0.8539 c87_fa_q8_outtok_q8: size GB 18.43, weighted score 0.8539 c101_fa_bf16_ryssig_q8: size GB 19.6, weighted score 0.8539 c534_fa_iq4kss_outtok_bf16: size GB 19.75, weighted score 0.8539 c105_fa_bf16_bndffn_q8: size GB 19.79, weighted score 0.8539 c99_fa_bf16_outtok_q8: size GB 20.2, weighted score 0.8539 c102_fa_bf16_ryssig_bf16: size GB 20.26, weighted score 0.8539 c319_out_iq4xs: size GB 16.19, weighted score 0.8539 c644_fa_iq5k_ryssig_iq4kss: size GB 16.74, weighted score 0.8539 c142_fa_q8_outtok_q8_lateffn_q8: size GB 18.79, weighted score 0.8539 c148_fa_bf16_outtok_q8_lateffn_q8: size GB 20.56, weighted score 0.8539 c211_fao_iq3kt: size GB 16.46, weighted score 0.8524 c466_latectl_q4kr4: size GB 16.55, weighted score 0.852 c298_finalffn_iq6k: size GB 16.77, weighted score 0.852 c722_fa_iq3kt_rysffn_iq4kss: size GB 16.05, weighted score 0.8497 c262_rysffn_mxfp4: size GB 16.47, weighted score 0.8471 c309_outtok_iq4xs: size GB 16.15, weighted score 0.8462 c482_outtok_q40r8: size GB 16.23, weighted score 0.8462 c424_ryspath_iq3kr4: size GB 16.4, weighted score 0.8462 c302_finalffn_mxfp4: size GB 16.53, weighted score 0.8462 c282_lateffn_mxfp4: size GB 16.53, weighted score 0.8462 c232_ryssig_mxfp4: size GB 16.53, weighted score 0.8462 c381_fa_q4kr4: size GB 16.54, weighted score 0.8462 c195_fav_iq5ks: size GB 16.55, weighted score 0.8462 c419_ryspath_iq5ksr4: size GB 16.66, weighted score 0.8462 c321_out_iq3kt: size GB 16.01, weighted score 0.846 c346_srcsig_iq4kt: size GB 16.53, weighted score 0.846 c295_finalffn_iq5ks: size GB 16.63, weighted score 0.846 c287_latectl_iq5k: size GB 16.71, weighted score 0.8452 c591_fa_iq5ks_lateffn_iq5ks: size GB 16.79, weighted score 0.8452 c393_faqk_q8kvr8: size GB 17.09, weighted score 0.8452 c475_outtok_iq3kr4: size GB 15.89, weighted score 0.8443 c529_fa_iq4kss_outtok_iq4ks: size GB 16.02, weighted score 0.8443 c473_outtok_iq4xsr8: size GB 16.15, weighted score 0.8443 c303_outtok_iq4ks: size GB 16.15, weighted score 0.8443 c313_out_iq4ks: size GB 16.19, weighted score 0.8443 c316_out_iq4k: size GB 16.23, weighted score 0.8443 c483_outtok_q4kr4: size GB 16.23, weighted score 0.8443 c741_fa_iq3kt_finalffn_iq5ks: size GB 16.3, weighted score 0.8443 c331_tok_iq3kt: size GB 16.34, weighted score 0.8443 c291_latectl_iq3kt: size GB 16.34, weighted score 0.8443 c181_faqk_iq3kt: size GB 16.34, weighted score 0.8443 c554_fa_iq4kss_lateffn_iq4kss: size GB 16.38, weighted score 0.8443 c698_fa_iq4kt_lateffn_iq4kss: size GB 16.38, weighted score 0.8443 c680_fa_iq4kt_ryssig_iq4kss: size GB 16.38, weighted score 0.8443 c260_rysffn_iq4kt: size GB 16.39, weighted score 0.8443 c559_fa_iq4kss_finalffn_iq4ks: size GB 16.4, weighted score 0.8443 c301_finalffn_iq3kt: size GB 16.42, weighted score 0.8443 c511_fa_iq4ks_bndffn_iq4ks: size GB 16.45, weighted score 0.8443 c365_duppath_iq3kt: size GB 16.46, weighted score 0.8443 c305_outtok_iq5ks: size GB 16.47, weighted score 0.8443 c284_latectl_iq4kss: size GB 16.47, weighted score 0.8443 c324_tok_iq4kss: size GB 16.48, weighted score 0.8443 c341_earlyattn_iq3kt: size GB 16.48, weighted score 0.8443 c367_fa_iq4ksr4: size GB 16.49, weighted score 0.8443 c347_srcsig_iq3kt: size GB 16.5, weighted score 0.8443 c730_fa_iq3kt_bndffn_q6k: size GB 16.5, weighted score 0.8443 c405_faqkv_iq4xsr8: size GB 16.5, weighted score 0.8443 c401_faqkv_iq4ksr4: size GB 16.5, weighted score 0.8443 c452_latectl_iq4ksr4: size GB 16.51, weighted score 0.8443 c179_faqk_iq4xs: size GB 16.52, weighted score 0.8443 c362_duppath_iq4kss: size GB 16.52, weighted score 0.8443 c249_bndffn_iq4xs: size GB 16.52, weighted score 0.8443 c549_fa_iq4kss_bndffn_iq5ks: size GB 16.53, weighted score 0.8443 c439_ryssig_iq4xsr8: size GB 16.53, weighted score 0.8443 c344_srcsig_iq4kss: size GB 16.53, weighted score 0.8443 c352_dupsig_iq4kt: size GB 16.53, weighted score 0.8443 c223_ryssig_iq4ks: size GB 16.53, weighted score 0.8443 c272_lateattn_mxfp4: size GB 16.54, weighted score 0.8443 c263_lateattn_iq4ks: size GB 16.54, weighted score 0.8443 c360_srcpath_mxfp4: size GB 16.54, weighted score 0.8443 c355_srcpath_iq4ks: size GB 16.54, weighted score 0.8443 c361_duppath_iq4ks: size GB 16.54, weighted score 0.8443 c212_fao_mxfp4: size GB 16.54, weighted score 0.8443 c342_earlyattn_mxfp4: size GB 16.54, weighted score 0.8443 c199_fav_iq4xs: size GB 16.54, weighted score 0.8443 c372_fa_iq4nlr4: size GB 16.54, weighted score 0.8443 c380_fa_q40r8: size GB 16.54, weighted score 0.8443 c403_faqkv_iq4kr4: size GB 16.54, weighted score 0.8443 c343_srcsig_iq4ks: size GB 16.54, weighted score 0.8443 c266_lateattn_iq4k: size GB 16.55, weighted score 0.8443 c286_latectl_iq4k: size GB 16.55, weighted score 0.8443 c420_ryspath_iq4kr4: size GB 16.55, weighted score 0.8443 c246_bndffn_iq4k: size GB 16.55, weighted score 0.8443 c276_lateffn_iq4k: size GB 16.55, weighted score 0.8443 c296_finalffn_iq4k: size GB 16.55, weighted score 0.8443 c326_tok_iq4k: size GB 16.55, weighted score 0.8443 c389_faqk_iq4nlr4: size GB 16.55, weighted score 0.8443 c437_ryssig_iq4kr4: size GB 16.55, weighted score 0.8443 c489_fav_q8kv: size GB 16.58, weighted score 0.8443 c351_dupsig_iq5ks: size GB 16.59, weighted score 0.8443 c265_lateattn_iq5ks: size GB 16.6, weighted score 0.8443 c337_earlyattn_iq5k: size GB 16.6, weighted score 0.8443 c363_duppath_iq5ks: size GB 16.61, weighted score 0.8443 c275_lateffn_iq5ks: size GB 16.62, weighted score 0.8443 c227_ryssig_iq5k: size GB 16.64, weighted score 0.8443 c438_ryssig_iq5kr4: size GB 16.64, weighted score 0.8443 c584_fa_iq5ks_bndffn_iq4kss: size GB 16.66, weighted score 0.8443 c215_ryspath_iq5ks: size GB 16.66, weighted score 0.8443 c285_latectl_iq5ks: size GB 16.67, weighted score 0.8443 c325_tok_iq5ks: size GB 16.68, weighted score 0.8443 c247_bndffn_iq5k: size GB 16.69, weighted score 0.8443 c268_lateattn_iq6k: size GB 16.69, weighted score 0.8443 c177_faqk_iq5k: size GB 16.71, weighted score 0.8443 c396_faqk_q50r4: size GB 16.71, weighted score 0.8443 c416_faqkv_q5kr4: size GB 16.71, weighted score 0.8443 c445_ryssig_q60: size GB 16.73, weighted score 0.8443 c446_ryssig_q60r4: size GB 16.73, weighted score 0.8443 c451_ryssig_q6kr4: size GB 16.73, weighted score 0.8443 c394_faqk_q60: size GB 16.86, weighted score 0.8443 c479_outtok_q60: size GB 16.86, weighted score 0.8443 c485_outtok_q6kr4: size GB 16.88, weighted score 0.8443 c328_tok_iq6k: size GB 16.89, weighted score 0.8443 c188_faqkv_iq6k: size GB 16.89, weighted score 0.8443 c426_ryspath_q8kr8: size GB 17.05, weighted score 0.8443 c575_fa_iq5ks_ryssig_q8: size GB 17.07, weighted score 0.8443 c392_faqk_q8kr8: size GB 17.1, weighted score 0.8443 c459_latectl_q8r8: size GB 17.17, weighted score 0.8443 c738_fa_iq3kt_lateffn_bf16: size GB 17.25, weighted score 0.8443 c63_fa_q5k_outtok_q8: size GB 17.72, weighted score 0.8443 c09_union_rys_path_q8_late_control_q8: size GB 17.72, weighted score 0.8443 c714_fa_iq3kt_outtok_bf16: size GB 19.55, weighted score 0.8443 c690_fa_iq4kt_rysffn_bf16: size GB 20.27, weighted score 0.8443 c582_fa_iq5ks_rysffn_bf16: size GB 20.57, weighted score 0.8443 c757_fa_mxfp4_rysffn_iq4ks: size GB 16.4, weighted score 0.8436 c310_outtok_iq4kt: size GB 16.07, weighted score 0.8433 c359_srcpath_iq3kt: size GB 16.46, weighted score 0.8433 c441_ryssig_iq3kr4: size GB 16.46, weighted score 0.8433 c256_rysffn_iq4k: size GB 16.55, weighted score 0.8433 c235_bndall_iq5ks: size GB 16.69, weighted score 0.8433 c589_fa_iq5ks_lateffn_iq4ks: size GB 16.7, weighted score 0.8433 c661_fa_iq5k_lateffn_iq4ks: size GB 16.76, weighted score 0.8433 c258_rysffn_iq6k: size GB 17.27, weighted score 0.8433 c283_latectl_iq4ks: size GB 16.51, weighted score 0.8424 c356_srcpath_iq4kss: size GB 16.52, weighted score 0.8424 c414_faqkv_q40r8: size GB 16.54, weighted score 0.8424 c468_latectl_q6kr4: size GB 16.87, weighted score 0.8424 c112_fav_bf16_fao_bf16: size GB 17.49, weighted score 0.8424 c264_lateattn_iq4kss: size GB 16.52, weighted score 0.8416 c69_fa_q5k_bndffn_q8: size GB 17.31, weighted score 0.8407 c430_ryspath_q50r4: size GB 16.69, weighted score 0.8404 c320_out_iq4kt: size GB 16.15, weighted score 0.839 c425_ryspath_q8r8: size GB 17.11, weighted score 0.8375 c588_fa_iq5ks_bndffn_bf16: size GB 18.26, weighted score 0.8365 c96_fa_q8_lateffn_bf16: size GB 18.51, weighted score 0.8365 c255_rysffn_iq5ks: size GB 16.81, weighted score 0.8363 c455_latectl_iq5kr4: size GB 16.71, weighted score 0.8356 c390_faqk_iq3kr4: size GB 16.39, weighted score 0.8346 c447_ryssig_q50r4: size GB 16.64, weighted score 0.8344 c469_outtok_iq4ksr4: size GB 16.15, weighted score 0.8337 c294_finalffn_iq4kss: size GB 16.51, weighted score 0.8337 c292_latectl_mxfp4: size GB 16.51, weighted score 0.8337 c194_fav_iq4kss: size GB 16.54, weighted score 0.8337 c406_faqkv_iq4nlr4: size GB 16.54, weighted score 0.8337 c707_fa_iq4kt_finalffn_q8: size GB 16.83, weighted score 0.8337 c462_latectl_q60: size GB 16.86, weighted score 0.8337 c240_bndall_iq4kt: size GB 16.46, weighted score 0.8337 c432_ryspath_q4kr4: size GB 16.55, weighted score 0.8337 c448_ryssig_q40r8: size GB 16.55, weighted score 0.8337 c399_faqk_q5kr4: size GB 16.71, weighted score 0.8337 c490_ryspath_q8kv: size GB 17.04, weighted score 0.8337 c410_faqkv_q8kvr8: size GB 17.12, weighted score 0.8337 c126_bndall_bf16_lateffn_q8: size GB 19.05, weighted score 0.8337 c257_rysffn_iq5k: size GB 16.89, weighted score 0.8336 c710_fa_iq3kt_outtok_iq4kss: size GB 15.74, weighted score 0.8322 c400_faqk_q6kr4: size GB 16.87, weighted score 0.8298 c261_rysffn_iq3kt: size GB 16.1, weighted score 0.8281 c388_faqk_iq4xsr8: size GB 16.52, weighted score 0.8281 c202_fav_mxfp4: size GB 16.54, weighted score 0.8276 c83_fa_q6k_lateffn_q8: size GB 17.39, weighted score 0.8268 c293_finalffn_iq4ks: size GB 16.53, weighted score 0.8241 c694_fa_iq4kt_bndffn_q6k: size GB 16.7, weighted score 0.8241 c745_fa_mxfp4_outtok_iq4ks: size GB 16.08, weighted score 0.8234 c517_fa_iq4ks_lateffn_iq4ks: size GB 16.46, weighted score 0.8163 c163_fa_iq4ks: size GB 16.49, weighted score 0.8163 c186_faqkv_iq4k: size GB 16.54, weighted score 0.8163 c454_latectl_iq4kr4: size GB 16.55, weighted score 0.8163 c457_latectl_iq4nlr4: size GB 16.55, weighted score 0.8163 c297_finalffn_iq5k: size GB 16.66, weighted score 0.8163 c175_faqk_iq5ks: size GB 16.67, weighted score 0.8163 c217_ryspath_iq5k: size GB 16.69, weighted score 0.8163 c585_fa_iq5ks_bndffn_iq5ks: size GB 16.82, weighted score 0.8163 c586_fa_iq5ks_bndffn_q6k: size GB 17, weighted score 0.8163 c168_fa_iq6k: size GB 17.04, weighted score 0.8163 c684_fa_iq4kt_ryssig_bf16: size GB 17.43, weighted score 0.8163 c374_fa_q8r8: size GB 17.49, weighted score 0.8163 c522_fa_iq4ks_lateffn_bf16: size GB 17.51, weighted score 0.8163 c594_fa_iq5ks_lateffn_bf16: size GB 17.75, weighted score 0.8163 c371_fa_iq4xsr8: size GB 16.48, weighted score 0.8135 c245_bndffn_iq5ks: size GB 16.66, weighted score 0.8135 c165_fa_iq5ks: size GB 16.72, weighted score 0.8086 c304_outtok_iq4kss: size GB 16.07, weighted score 0.8066 c415_faqkv_q4kr4: size GB 16.54, weighted score 0.8066 c357_srcpath_iq5ks: size GB 16.61, weighted score 0.8066 c185_faqkv_iq5ks: size GB 16.67, weighted score 0.8066 c402_faqkv_iq5ksr4: size GB 16.67, weighted score 0.8066 c726_fa_iq3kt_rysffn_bf16: size GB 20.06, weighted score 0.8066 c314_out_iq4kss: size GB 16.15, weighted score 0.7961 c458_latectl_iq3kr4: size GB 16.39, weighted score 0.796 c487_faqk_q8kv: size GB 17.09, weighted score 0.796 c222_ryspath_mxfp4: size GB 16.52, weighted score 0.7858 c376_fa_q8kvr8: size GB 17.37, weighted score 0.7451 c378_fa_q60r4: size GB 17.01, weighted score 0.6776 c383_fa_q6kr4: size GB 17.03, weighted score 0.6776 c486_fa_q8kv: size GB 17.37, weighted score 0.5301 c375_fa_q8kr8: size GB 17.38, weighted score 0.5266 c687_fa_iq4kt_rysffn_iq5ks: size GB 16.68, weighted score 0.3271 c488_faqkv_q8kv: size GB 17.12, weighted score 0 c581_fa_iq5ks_rysffn_q8: size GB 18.06, weighted score 0 size GB weighted score

Every real-world harness result

These are the project-level ClawCode harness rows. They are the closest tests to the natural use case: work inside a project, infer the files, implement, test, and repair.

Every real-world ClawCode harness result row, grouped by candidate.
CandidateKritaLiteSyncBoardLegacy-codebase migrationRecipe spec
c718_fa_iq3kt_ryssig_q6k 14/15
93.3%
16/19
84.2%
19/19
100.0%
fa=iq3_kt, rys_signal=q6_K
unsloth_q8_0 14/15
93.3%
19/19
100.0%
18/19
94.7%
c154_fao_bf16_faqk_q8_lateffn_q8 13/15
86.7%
18/19
94.7%
19/19
100.0%
fa_o=bf16, fa_qk=q8_0, late_ffn=q8_0
c204_fao_iq4kss 13/15
86.7%
19/19
100.0%
8/19
42.1%
fa_o=iq4_kss
c311_outtok_iq3kt 12/15
80.0%
not run 19/19
100.0%
output_token=iq3_kt
c373_fa_iq3kr4 13/15
86.7%
18/19
94.7%
18/19
94.7%
fa=iq3_k_r4
c281_lateffn_iq3kt 12/15
80.0%
not run 18/19
94.7%
late_ffn=iq3_kt
c404_faqkv_iq5kr4 not run not run 18/19
94.7%
fa_qkv=iq5_k_r4
c76_fa_q6k_outtok_bf16 14/15
93.3%
18/19
94.7%
8/19
42.1%
fa=q6_K, output_token=bf16
unsloth_iq4nl 14/15
93.3%
18/19
94.7%
17/19
89.5%
release_s010_iq4nl 12/15
80.0%
18/19
94.7%
16/19
84.2%
c317_out_iq5k not run not run 17/19
89.5%
output_only=iq5_k
c05_all_fullattn_bf16 13/15
86.7%
not run not run ^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
c431_ryspath_q40r8 not run not run 14/19
73.7%
rys_path=q4_0_r8
Raw real-world run rows. Some useful agent runs had non-zero Claw return codes but passed verifier checks, so verifier component score is the release metric.
SuiteCandidateComponentsScoreElapsed sPoolContextClaw rcVerify rcFilesTests
KritaLitec718_fa_iq3kt_ryssig_q6k14/1593.3%448high160000010
KritaLitec76_fa_q6k_outtok_bf1614/1593.3%1195small_b160000010
KritaLiteunsloth_iq4nl14/1593.3%246high160000010
KritaLiteunsloth_q8_014/1593.3%1495small_a160000010
KritaLitec05_all_fullattn_bf1613/1586.7%3187small_a160000010
KritaLitec154_fao_bf16_faqk_q8_lateffn_q813/1586.7%779high160000010
KritaLitec204_fao_iq4kss13/1586.7%1924small_a160000010
KritaLitec373_fa_iq3kr413/1586.7%837small_a160000010
KritaLitec281_lateffn_iq3kt12/1580.0%2130small_b160000010
KritaLitec311_outtok_iq3kt12/1580.0%1933small_b160000010
KritaLiterelease_s010_iq4nl12/1580.0%568high160000010
Legacy-codebase migrationc154_fao_bf16_faqk_q8_lateffn_q819/19100.0%7041high1600001055/5
Legacy-codebase migrationc311_outtok_iq3kt19/19100.0%10906small_a1600000086/6
Legacy-codebase migrationc718_fa_iq3kt_ryssig_q6k19/19100.0%2100high1600001086/6
Legacy-codebase migrationc281_lateffn_iq3kt18/1994.7%1497small_a1600000043/3
Legacy-codebase migrationc373_fa_iq3kr418/1994.7%7742small_b1600000055/5
Legacy-codebase migrationc404_faqkv_iq5kr418/1994.7%2045small_a1600001053/4
Legacy-codebase migrationunsloth_q8_018/1994.7%1816high1600001053/3
Legacy-codebase migrationc317_out_iq5k17/1989.5%1841small_b1600000033/3
Legacy-codebase migrationunsloth_iq4nl17/1989.5%2062high1600001055/5
Legacy-codebase migrationrelease_s010_iq4nl16/1984.2%1611high1600001053/3
Legacy-codebase migrationc431_ryspath_q40r814/1973.7%14400small_b160000124113/3
Legacy-codebase migrationc204_fao_iq4kss8/1942.1%4397small_a1600000113/3
Legacy-codebase migrationc76_fa_q6k_outtok_bf168/1942.1%1466high1600001113/3
SyncBoardc204_fao_iq4kss19/19100.0%1240small_b160000000
SyncBoardunsloth_q8_019/19100.0%647high160000000
SyncBoardc154_fao_bf16_faqk_q8_lateffn_q818/1994.7%3724small_a160000110
SyncBoardc373_fa_iq3kr418/1994.7%348high160000010
SyncBoardc76_fa_q6k_outtok_bf1618/1994.7%601small_a160000010
SyncBoardrelease_s010_iq4nl18/1994.7%804small_b160000010
SyncBoardunsloth_iq4nl18/1994.7%1145small_a160000010
SyncBoardc718_fa_iq3kt_ryssig_q6k16/1984.2%2350high160000010

Comprehensive candidate table

This table contains all 440 rows in summary_signallatch_comprehensive_eval_latest.json, joined with recipe descriptions from the candidate catalog when available.

440 comprehensive result rows. Sorted by weighted score descending. Selected public rows are highlighted.
RankCandidateWeightedHabitHardMeanSizeErrorsRecipe specDescription / exact rules
1 c373_fa_iq3kr4 0.8935 0.8786 0.9130 0.8979 16.29 GB 0 fa=iq3_k_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_k_r4
2 c281_lateffn_iq3kt 0.8916 0.8786 0.9085 0.8954 16.43 GB 0 late_ffn=iq3_kt Mixed manual probe: late FFN gate/down control tensors at iq3_kt.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq3_kt
3 c718_fa_iq3kt_ryssig_q6k 0.8822 0.9286 0.8219 0.8685 16.40 GB 0 fa=iq3_kt, rys_signal=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; RYS 15..24 linear-attention and SSM signal path at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_K
4 c204_fao_iq4kss 0.8761 0.9143 0.8263 0.8648 16.52 GB 0 fa_o=iq4_kss Mixed manual probe: full-attention output tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq4_kss
5 c311_outtok_iq3kt 0.8741 0.9143 0.8219 0.8623 15.79 GB 0 output_token=iq3_kt Mixed manual probe: token embedding and output tensors at iq3_kt.
rules
^output\.weight=iq3_kt
^token_embd\.weight=iq3_kt
6 c317_out_iq5k 0.8741 0.9143 0.8219 0.8623 16.39 GB 0 output_only=iq5_k Mixed manual probe: output tensor at iq5_k.
rules
^output\.weight=iq5_k
7 c254_rysffn_iq4kss 0.8741 0.9143 0.8219 0.8623 16.39 GB 0 rys_ffn=iq4_kss Mixed manual probe: RYS 15..24 FFN tensors at iq4_kss.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_kss
8 c500_fa_iq4ks_ryssig_iq4kss 0.8741 0.9143 0.8219 0.8623 16.44 GB 0 fa=iq4_ks, rys_signal=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; RYS 15..24 linear-attention and SSM signal path at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
9 c234_bndall_iq4kss 0.8741 0.9143 0.8219 0.8623 16.46 GB 0 boundary_all=iq4_kss Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq4_kss.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq4_kss
10 c253_rysffn_iq4ks 0.8741 0.9143 0.8219 0.8623 16.47 GB 0 rys_ffn=iq4_ks Mixed manual probe: RYS 15..24 FFN tensors at iq4_ks.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_ks
11 c169_fa_iq4xs 0.8741 0.9143 0.8219 0.8623 16.48 GB 0 fa=iq4_xs Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_xs.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_xs
12 c274_lateffn_iq4kss 0.8741 0.9143 0.8219 0.8623 16.51 GB 0 late_ffn=iq4_kss Mixed manual probe: late FFN gate/down control tensors at iq4_kss.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kss
13 c456_latectl_iq4xsr8 0.8741 0.9143 0.8219 0.8623 16.51 GB 0 late_control=iq4_xs_r8 Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_xs_r8.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_xs_r8
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_xs_r8
14 c213_ryspath_iq4ks 0.8741 0.9143 0.8219 0.8623 16.52 GB 0 rys_path=iq4_ks Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_ks.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
15 c364_duppath_iq4kt 0.8741 0.9143 0.8219 0.8623 16.52 GB 0 dup_path=iq4_kt Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at iq4_kt.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kt
16 c201_fav_iq3kt 0.8741 0.9143 0.8219 0.8623 16.53 GB 0 fa_v=iq3_kt Mixed manual probe: full-attention V tensors at iq3_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq3_kt
17 c273_lateffn_iq4ks 0.8741 0.9143 0.8219 0.8623 16.53 GB 0 late_ffn=iq4_ks Mixed manual probe: late FFN gate/down control tensors at iq4_ks.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_ks
18 c200_fav_iq4kt 0.8741 0.9143 0.8219 0.8623 16.54 GB 0 fa_v=iq4_kt Mixed manual probe: full-attention V tensors at iq4_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq4_kt
19 c339_earlyattn_iq4xs 0.8741 0.9143 0.8219 0.8623 16.54 GB 0 early_attn=iq4_xs Mixed manual probe: early full-attention routing tensors at iq4_xs.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq4_xs
20 c193_fav_iq4ks 0.8741 0.9143 0.8219 0.8623 16.54 GB 0 fa_v=iq4_ks Mixed manual probe: full-attention V tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq4_ks
21 c369_fa_iq4kr4 0.8741 0.9143 0.8219 0.8623 16.54 GB 0 fa=iq4_k_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k_r4
22 c336_earlyattn_iq4k 0.8741 0.9143 0.8219 0.8623 16.55 GB 0 early_attn=iq4_k Mixed manual probe: early full-attention routing tensors at iq4_k.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq4_k
23 c423_ryspath_iq4nlr4 0.8741 0.9143 0.8219 0.8623 16.55 GB 0 rys_path=iq4_nl_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_nl_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_nl_r4
24 c431_ryspath_q40r8 0.8741 0.9143 0.8219 0.8623 16.55 GB 0 rys_path=q4_0_r8 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q4_0_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q4_0_r8
25 c206_fao_iq4k 0.8741 0.9143 0.8219 0.8623 16.55 GB 0 fa_o=iq4_k Mixed manual probe: full-attention output tensors at iq4_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq4_k
26 c226_ryssig_iq4k 0.8741 0.9143 0.8219 0.8623 16.55 GB 0 rys_signal=iq4_k Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_k.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_k
27 c397_faqk_q40r8 0.8741 0.9143 0.8219 0.8623 16.55 GB 0 fa_qk=q4_0_r8 Mixed manual probe: full-attention Q/K tensors at q4_0_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q4_0_r8
28 c205_fao_iq5ks 0.8741 0.9143 0.8219 0.8623 16.61 GB 0 fa_o=iq5_ks Mixed manual probe: full-attention output tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq5_ks
29 c433_ryspath_q5kr4 0.8741 0.9143 0.8219 0.8623 16.69 GB 0 rys_path=q5_k_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q5_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q5_k_r4
30 c649_fa_iq5k_rysffn_iq4ks 0.8741 0.9143 0.8219 0.8623 16.70 GB 0 fa=iq5_k, rys_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; RYS 15..24 FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_ks
31 c21_fao_q6k 0.8741 0.9143 0.8219 0.8623 16.70 GB 0 fa_o=q6_K Mixed manual probe: full-attention output tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=q6_K
32 c208_fao_iq6k 0.8741 0.9143 0.8219 0.8623 16.71 GB 0 fa_o=iq6_k Mixed manual probe: full-attention output tensors at iq6_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq6_k
33 c404_faqkv_iq5kr4 0.8741 0.9143 0.8219 0.8623 16.71 GB 0 fa_qkv=iq5_k_r4 Mixed manual probe: full-attention Q/K/V tensors at iq5_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq5_k_r4
34 c610_fa_iq4k_ryssig_q6k 0.8741 0.9143 0.8219 0.8623 16.72 GB 0 fa=iq4_k, rys_signal=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k; RYS 15..24 linear-attention and SSM signal path at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_K
35 c655_fa_iq5k_bndffn_iq4ks 0.8741 0.9143 0.8219 0.8623 16.75 GB 0 fa=iq5_k, boundary_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; RYS entry, exit, and duplicate-boundary FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_ks
36 c167_fa_iq5k 0.8741 0.9143 0.8219 0.8623 16.78 GB 0 fa=iq5_k Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
37 c370_fa_iq5kr4 0.8741 0.9143 0.8219 0.8623 16.78 GB 0 fa=iq5_k_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k_r4
38 c379_fa_q50r4 0.8741 0.9143 0.8219 0.8623 16.78 GB 0 fa=q5_0_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_0_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_0_r4
39 c22_fao_q8 0.8741 0.9143 0.8219 0.8623 16.84 GB 0 fa_o=q8_0 Mixed manual probe: full-attention output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=q8_0
40 c391_faqk_q8r8 0.8741 0.9143 0.8219 0.8623 17.17 GB 0 fa_qk=q8_0_r8 Mixed manual probe: full-attention Q/K tensors at q8_0_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0_r8
41 c616_fa_iq4k_rysffn_q6k 0.8741 0.9143 0.8219 0.8623 17.23 GB 0 fa=iq4_k, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
42 c587_fa_iq5ks_bndffn_q8 0.8741 0.9143 0.8219 0.8623 17.26 GB 0 fa=iq5_ks, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
43 c77_fa_q6k_ryssig_q8 0.8741 0.9143 0.8219 0.8623 17.38 GB 0 fa=q6_K, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
44 c666_fa_iq5k_lateffn_bf16 0.8741 0.9143 0.8219 0.8623 17.80 GB 0 fa=iq5_k, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
45 c93_fa_q8_bndffn_q8 0.8741 0.9143 0.8219 0.8623 18.02 GB 0 fa=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
46 c154_fao_bf16_faqk_q8_lateffn_q8 0.8741 0.9143 0.8219 0.8623 18.34 GB 0 fa_o=bf16, fa_qk=q8_0, late_ffn=q8_0 Mixed manual probe: full-attention output tensors at bf16; full-attention Q/K tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
47 c06_output_token_q8_fullattn_q8 0.8741 0.9143 0.8219 0.8623 18.43 GB 0 ^output\.weight=q8_0; ^token_embd\.weight=q8_0; ^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0 Protect expression/input surfaces plus full-attention routing: token/output Q8_0 and full-attention Q/K/V/O Q8_0.
rules
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
48 c05_all_fullattn_bf16 0.8741 0.9143 0.8219 0.8623 19.26 GB 0 ^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16 Full-attention ceiling probe: all full-attention Q/K/V/O tensors remain BF16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
49 c56_outtok_bf16 0.8741 0.9143 0.8219 0.8623 19.88 GB 0 output_token=bf16 Mixed manual probe: token embedding and output tensors at bf16.
rules
^output\.weight=bf16
^token_embd\.weight=bf16
50 c145_fa_bf16_outtok_q8_ryssig_q8 0.8741 0.9143 0.8219 0.8623 20.55 GB 0 fa=bf16, output_token=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
51 c147_fa_bf16_outtok_q8_bndffn_q8 0.8741 0.9143 0.8219 0.8623 20.73 GB 0 fa=bf16, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
52 c106_fa_bf16_bndffn_bf16 0.8741 0.9143 0.8219 0.8623 20.79 GB 0 fa=bf16, boundary_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
53 c88_fa_q8_outtok_bf16 0.8741 0.9143 0.8219 0.8623 20.81 GB 0 fa=q8_0, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=bf16
^token_embd\.weight=bf16
54 c100_fa_bf16_outtok_bf16 0.8741 0.9143 0.8219 0.8623 22.58 GB 0 fa=bf16, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=bf16
^token_embd\.weight=bf16
55 c172_fa_mxfp4 0.8734 0.9143 0.8202 0.8614 16.48 GB 0 fa=mxfp4 Mixed manual probe: all full-attention Q/K/V/O tensors at mxfp4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=mxfp4
56 c192_faqkv_mxfp4 0.8734 0.9143 0.8202 0.8614 16.50 GB 0 fa_qkv=mxfp4 Mixed manual probe: full-attention Q/K/V tensors at mxfp4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=mxfp4
57 c239_bndall_iq4xs 0.8722 0.9143 0.8174 0.8598 16.51 GB 0 boundary_all=iq4_xs Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq4_xs.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq4_xs
58 c377_fa_q60 0.8722 0.9143 0.8174 0.8598 17.01 GB 0 fa=q6_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_0
59 c413_faqkv_q50r4 0.8715 0.9143 0.8157 0.8589 16.71 GB 0 fa_qkv=q5_0_r4 Mixed manual probe: full-attention Q/K/V tensors at q5_0_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q5_0_r4
60 c207_fao_iq5k 0.8683 0.9143 0.8085 0.8548 16.63 GB 0 fa_o=iq5_k Mixed manual probe: full-attention output tensors at iq5_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq5_k
61 c312_outtok_mxfp4 0.8678 0.8786 0.8537 0.8646 16.15 GB 0 output_token=mxfp4 Mixed manual probe: token embedding and output tensors at mxfp4.
rules
^output\.weight=mxfp4
^token_embd\.weight=mxfp4
62 c300_finalffn_iq4kt 0.8664 0.9143 0.8041 0.8523 16.51 GB 0 final_ffn=iq4_kt Mixed manual probe: final FFN tensors at iq4_kt.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_kt
63 c474_outtok_iq4nlr4 0.8645 0.9143 0.7996 0.8498 16.23 GB 0 output_token=iq4_nl_r4 Mixed manual probe: token embedding and output tensors at iq4_nl_r4.
rules
^output\.weight=iq4_nl_r4
^token_embd\.weight=iq4_nl_r4
64 c691_fa_iq4kt_bndffn_iq4ks 0.8645 0.9143 0.7996 0.8498 16.39 GB 0 fa=iq4_kt, boundary_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS entry, exit, and duplicate-boundary FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_ks
65 c184_faqkv_iq4kss 0.8645 0.9143 0.7996 0.8498 16.46 GB 0 fa_qkv=iq4_kss Mixed manual probe: full-attention Q/K/V tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_kss
66 c330_tok_iq4kt 0.8645 0.9143 0.7996 0.8498 16.48 GB 0 token_only=iq4_kt Mixed manual probe: token embedding tensor at iq4_kt.
rules
^token_embd\.weight=iq4_kt
67 c174_faqk_iq4kss 0.8645 0.9143 0.7996 0.8498 16.48 GB 0 fa_qk=iq4_kss Mixed manual probe: full-attention Q/K tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_kss
68 c244_bndffn_iq4kss 0.8645 0.9143 0.7996 0.8498 16.49 GB 0 boundary_ffn=iq4_kss Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq4_kss.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_kss
69 c329_tok_iq4xs 0.8645 0.9143 0.7996 0.8498 16.52 GB 0 token_only=iq4_xs Mixed manual probe: token embedding tensor at iq4_xs.
rules
^token_embd\.weight=iq4_xs
70 c219_ryspath_iq4xs 0.8645 0.9143 0.7996 0.8498 16.52 GB 0 rys_path=iq4_xs Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_xs.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_xs
71 c210_fao_iq4kt 0.8645 0.9143 0.7996 0.8498 16.52 GB 0 fa_o=iq4_kt Mixed manual probe: full-attention output tensors at iq4_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq4_kt
72 c252_bndffn_mxfp4 0.8645 0.9143 0.7996 0.8498 16.52 GB 0 boundary_ffn=mxfp4 Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at mxfp4.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=mxfp4
73 c279_lateffn_iq4xs 0.8645 0.9143 0.7996 0.8498 16.53 GB 0 late_ffn=iq4_xs Mixed manual probe: late FFN gate/down control tensors at iq4_xs.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_xs
74 c209_fao_iq4xs 0.8645 0.9143 0.7996 0.8498 16.54 GB 0 fa_o=iq4_xs Mixed manual probe: full-attention output tensors at iq4_xs.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq4_xs
75 c203_fao_iq4ks 0.8645 0.9143 0.7996 0.8498 16.54 GB 0 fa_o=iq4_ks Mixed manual probe: full-attention output tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq4_ks
76 c166_fa_iq4k 0.8645 0.9143 0.7996 0.8498 16.54 GB 0 fa=iq4_k Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
77 c472_outtok_iq5kr4 0.8645 0.9143 0.7996 0.8498 16.54 GB 0 output_token=iq5_k_r4 Mixed manual probe: token embedding and output tensors at iq5_k_r4.
rules
^output\.weight=iq5_k_r4
^token_embd\.weight=iq5_k_r4
78 c267_lateattn_iq5k 0.8645 0.9143 0.7996 0.8498 16.62 GB 0 late_attn=iq5_k Mixed manual probe: late full-attention control tensors at iq5_k.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
79 c583_fa_iq5ks_bndffn_iq4ks 0.8645 0.9143 0.7996 0.8498 16.69 GB 0 fa=iq5_ks, boundary_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS entry, exit, and duplicate-boundary FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_ks
80 c639_fa_iq5k_outtok_iq5ks 0.8645 0.9143 0.7996 0.8498 16.69 GB 0 fa=iq5_k, output_token=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; token embedding and output tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^output\.weight=iq5_ks
^token_embd\.weight=iq5_ks
81 c464_latectl_q50r4 0.8645 0.9143 0.7996 0.8498 16.71 GB 0 late_control=q5_0_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at q5_0_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_0_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q5_0_r4
82 c467_latectl_q5kr4 0.8645 0.9143 0.7996 0.8498 16.71 GB 0 late_control=q5_k_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at q5_k_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_k_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q5_k_r4
83 c382_fa_q5kr4 0.8645 0.9143 0.7996 0.8498 16.78 GB 0 fa=q5_k_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_k_r4
84 c491_ryssig_q8kv 0.8645 0.9143 0.7996 0.8498 16.86 GB 0 rys_signal=q8_KV Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q8_KV.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_KV
85 c604_fa_iq4k_outtok_q6k 0.8645 0.9143 0.7996 0.8498 16.87 GB 0 fa=iq4_k, output_token=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k; token embedding and output tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^output\.weight=q6_K
^token_embd\.weight=q6_K
86 c178_faqk_iq6k 0.8645 0.9143 0.7996 0.8498 16.88 GB 0 fa_qk=iq6_k Mixed manual probe: full-attention Q/K tensors at iq6_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq6_k
87 c111_fav_bf16_fao_q8 0.8645 0.9143 0.7996 0.8498 16.96 GB 0 fa_v=bf16, fa_o=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=q8_0
88 c544_fa_iq4kss_rysffn_q6k 0.8645 0.9143 0.7996 0.8498 17.12 GB 0 fa=iq4_kss, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
89 c23_fao_bf16 0.8645 0.9143 0.7996 0.8498 17.37 GB 0 fa_o=bf16 Mixed manual probe: full-attention output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
90 c564_fa_iq4kss_finalffn_bf16 0.8645 0.9143 0.7996 0.8498 17.58 GB 0 fa=iq4_kss, final_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; final FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
91 c569_fa_iq5ks_outtok_q8 0.8645 0.9143 0.7996 0.8498 17.67 GB 0 fa=iq5_ks, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^output\.weight=q8_0
^token_embd\.weight=q8_0
92 c152_fav_bf16_fao_bf16_bndffn_q8 0.8645 0.9143 0.7996 0.8498 18.03 GB 0 fa_v=bf16, fa_o=bf16, boundary_ffn=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
93 c151_fav_bf16_fao_bf16_outtok_q8 0.8645 0.9143 0.7996 0.8498 18.44 GB 0 fa_v=bf16, fa_o=bf16, output_token=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at bf16; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
94 c141_fa_q8_outtok_q8_bndffn_q8 0.8645 0.9143 0.7996 0.8498 18.97 GB 0 fa=q8_0, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
95 c64_fa_q5k_outtok_bf16 0.8645 0.9143 0.7996 0.8498 20.11 GB 0 fa=q5_K, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=bf16
^token_embd\.weight=bf16
96 c76_fa_q6k_outtok_bf16 0.8645 0.9143 0.7996 0.8498 20.36 GB 0 fa=q6_K, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=bf16
^token_embd\.weight=bf16
97 c751_fa_mxfp4_ryssig_iq4ks 0.8638 0.9143 0.7980 0.8489 16.46 GB 0 fa=mxfp4, rys_signal=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at mxfp4; RYS 15..24 linear-attention and SSM signal path at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=mxfp4
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
98 c592_fa_iq5ks_lateffn_q6k 0.8635 0.9143 0.7974 0.8485 16.91 GB 0 fa=iq5_ks, late_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; late FFN gate/down control tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q6_K
99 c322_out_mxfp4 0.8581 0.8786 0.8315 0.8521 16.19 GB 0 output_only=mxfp4 Mixed manual probe: output tensor at mxfp4.
rules
^output\.weight=mxfp4
100 c407_faqkv_iq3kr4 0.8575 0.8786 0.8300 0.8512 16.37 GB 0 fa_qkv=iq3_k_r4 Mixed manual probe: full-attention Q/K/V tensors at iq3_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq3_k_r4
101 c214_ryspath_iq4kss 0.8575 0.8786 0.8300 0.8512 16.48 GB 0 rys_path=iq4_kss Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_kss.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
102 c190_faqkv_iq4kt 0.8559 0.8786 0.8263 0.8492 16.46 GB 0 fa_qkv=iq4_kt Mixed manual probe: full-attention Q/K/V tensors at iq4_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_kt
103 c242_bndall_mxfp4 0.8559 0.8786 0.8263 0.8492 16.51 GB 0 boundary_all=mxfp4 Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at mxfp4.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=mxfp4
104 c348_srcsig_mxfp4 0.8559 0.8786 0.8263 0.8492 16.54 GB 0 src_signal=mxfp4 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at mxfp4.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=mxfp4
105 c450_ryssig_q5kr4 0.8559 0.8786 0.8263 0.8492 16.64 GB 0 rys_signal=q5_k_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q5_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q5_k_r4
106 c277_lateffn_iq5k 0.8559 0.8786 0.8263 0.8492 16.64 GB 0 late_ffn=iq5_k Mixed manual probe: late FFN gate/down control tensors at iq5_k.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_k
107 c590_fa_iq5ks_lateffn_iq4kss 0.8559 0.8786 0.8263 0.8492 16.68 GB 0 fa=iq5_ks, late_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; late FFN gate/down control tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kss
108 c492_lateattn_q8kv 0.8559 0.8786 0.8263 0.8492 16.78 GB 0 late_attn=q8_KV Mixed manual probe: late full-attention control tensors at q8_KV.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_KV
109 c429_ryspath_q60r4 0.8559 0.8786 0.8263 0.8492 16.83 GB 0 rys_path=q6_0_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q6_0_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_0_r4
110 c494_fa_iq4ks_outtok_iq4kss 0.8539 0.8786 0.8219 0.8467 16.00 GB 0 fa=iq4_ks, output_token=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; token embedding and output tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^output\.weight=iq4_kss
^token_embd\.weight=iq4_kss
111 c734_fa_iq3kt_lateffn_iq4kss 0.8539 0.8786 0.8219 0.8467 16.18 GB 0 fa=iq3_kt, late_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; late FFN gate/down control tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kss
112 c171_fa_iq3kt 0.8539 0.8786 0.8219 0.8467 16.22 GB 0 fa=iq3_kt Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
113 c306_outtok_iq4k 0.8539 0.8786 0.8219 0.8467 16.23 GB 0 output_token=iq4_k Mixed manual probe: token embedding and output tensors at iq4_k.
rules
^output\.weight=iq4_k
^token_embd\.weight=iq4_k
114 c471_outtok_iq4kr4 0.8539 0.8786 0.8219 0.8467 16.23 GB 0 output_token=iq4_k_r4 Mixed manual probe: token embedding and output tensors at iq4_k_r4.
rules
^output\.weight=iq4_k_r4
^token_embd\.weight=iq4_k_r4
115 c241_bndall_iq3kt 0.8539 0.8786 0.8219 0.8467 16.30 GB 0 boundary_all=iq3_kt Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq3_kt.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq3_kt
116 c191_faqkv_iq3kt 0.8539 0.8786 0.8219 0.8467 16.32 GB 0 fa_qkv=iq3_kt Mixed manual probe: full-attention Q/K/V tensors at iq3_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq3_kt
117 c315_out_iq5ks 0.8539 0.8786 0.8219 0.8467 16.35 GB 0 output_only=iq5_ks Mixed manual probe: output tensor at iq5_ks.
rules
^output\.weight=iq5_ks
118 c692_fa_iq4kt_bndffn_iq4kss 0.8539 0.8786 0.8219 0.8467 16.36 GB 0 fa=iq4_kt, boundary_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS entry, exit, and duplicate-boundary FFN tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_kss
119 c221_ryspath_iq3kt 0.8539 0.8786 0.8219 0.8467 16.36 GB 0 rys_path=iq3_kt Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq3_kt.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_kt
120 c251_bndffn_iq3kt 0.8539 0.8786 0.8219 0.8467 16.37 GB 0 boundary_ffn=iq3_kt Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq3_kt.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq3_kt
121 c505_fa_iq4ks_rysffn_iq4ks 0.8539 0.8786 0.8219 0.8467 16.40 GB 0 fa=iq4_ks, rys_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; RYS 15..24 FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_ks
122 c164_fa_iq4kss 0.8539 0.8786 0.8219 0.8467 16.43 GB 0 fa=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
123 c170_fa_iq4kt 0.8539 0.8786 0.8219 0.8467 16.43 GB 0 fa=iq4_kt Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
124 c524_fa_iq4ks_finalffn_iq4kss 0.8539 0.8786 0.8219 0.8467 16.44 GB 0 fa=iq4_ks, final_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; final FFN tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_kss
125 c231_ryssig_iq3kt 0.8539 0.8786 0.8219 0.8467 16.44 GB 0 rys_signal=iq3_kt Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq3_kt.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_kt
126 c271_lateattn_iq3kt 0.8539 0.8786 0.8219 0.8467 16.46 GB 0 late_attn=iq3_kt Mixed manual probe: late full-attention control tensors at iq3_kt.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
127 c470_outtok_iq5ksr4 0.8539 0.8786 0.8219 0.8467 16.47 GB 0 output_token=iq5_ks_r4 Mixed manual probe: token embedding and output tensors at iq5_ks_r4.
rules
^output\.weight=iq5_ks_r4
^token_embd\.weight=iq5_ks_r4
128 c259_rysffn_iq4xs 0.8539 0.8786 0.8219 0.8467 16.47 GB 0 rys_ffn=iq4_xs Mixed manual probe: RYS 15..24 FFN tensors at iq4_xs.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_xs
129 c290_latectl_iq4kt 0.8539 0.8786 0.8219 0.8467 16.47 GB 0 late_control=iq4_kt Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_kt.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kt
130 c180_faqk_iq4kt 0.8539 0.8786 0.8219 0.8467 16.48 GB 0 fa_qk=iq4_kt Mixed manual probe: full-attention Q/K tensors at iq4_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_kt
131 c220_ryspath_iq4kt 0.8539 0.8786 0.8219 0.8467 16.48 GB 0 rys_path=iq4_kt Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_kt.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kt
132 c250_bndffn_iq4kt 0.8539 0.8786 0.8219 0.8467 16.49 GB 0 boundary_ffn=iq4_kt Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq4_kt.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_kt
133 c353_dupsig_iq3kt 0.8539 0.8786 0.8219 0.8467 16.50 GB 0 dup_signal=iq3_kt Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at iq3_kt.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_kt
134 c189_faqkv_iq4xs 0.8539 0.8786 0.8219 0.8467 16.50 GB 0 fa_qkv=iq4_xs Mixed manual probe: full-attention Q/K/V tensors at iq4_xs.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_xs
135 c183_faqkv_iq4ks 0.8539 0.8786 0.8219 0.8467 16.50 GB 0 fa_qkv=iq4_ks Mixed manual probe: full-attention Q/K/V tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_ks
136 c233_bndall_iq4ks 0.8539 0.8786 0.8219 0.8467 16.51 GB 0 boundary_all=iq4_ks Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq4_ks.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq4_ks
137 c280_lateffn_iq4kt 0.8539 0.8786 0.8219 0.8467 16.51 GB 0 late_ffn=iq4_kt Mixed manual probe: late FFN gate/down control tensors at iq4_kt.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kt
138 c224_ryssig_iq4kss 0.8539 0.8786 0.8219 0.8467 16.51 GB 0 rys_signal=iq4_kss Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_kss.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
139 c230_ryssig_iq4kt 0.8539 0.8786 0.8219 0.8467 16.51 GB 0 rys_signal=iq4_kt Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_kt.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kt
140 c289_latectl_iq4xs 0.8539 0.8786 0.8219 0.8467 16.51 GB 0 late_control=iq4_xs Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_xs.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_xs
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_xs
141 c332_tok_mxfp4 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 token_only=mxfp4 Mixed manual probe: token embedding tensor at mxfp4.
rules
^token_embd\.weight=mxfp4
142 c323_tok_iq4ks 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 token_only=iq4_ks Mixed manual probe: token embedding tensor at iq4_ks.
rules
^token_embd\.weight=iq4_ks
143 c182_faqk_mxfp4 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 fa_qk=mxfp4 Mixed manual probe: full-attention Q/K tensors at mxfp4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=mxfp4
144 c422_ryspath_iq4xsr8 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 rys_path=iq4_xs_r8 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_xs_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_xs_r8
145 c173_faqk_iq4ks 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 fa_qk=iq4_ks Mixed manual probe: full-attention Q/K tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_ks
146 c384_faqk_iq4ksr4 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 fa_qk=iq4_ks_r4 Mixed manual probe: full-attention Q/K tensors at iq4_ks_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_ks_r4
147 c418_ryspath_iq4ksr4 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 rys_path=iq4_ks_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_ks_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks_r4
148 c358_srcpath_iq4kt 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 src_path=iq4_kt Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at iq4_kt.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kt
149 c270_lateattn_iq4kt 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 late_attn=iq4_kt Mixed manual probe: late full-attention control tensors at iq4_kt.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
150 c243_bndffn_iq4ks 0.8539 0.8786 0.8219 0.8467 16.52 GB 0 boundary_ffn=iq4_ks Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq4_ks.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_ks
151 c334_earlyattn_iq4kss 0.8539 0.8786 0.8219 0.8467 16.53 GB 0 early_attn=iq4_kss Mixed manual probe: early full-attention routing tensors at iq4_kss.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq4_kss
152 c340_earlyattn_iq4kt 0.8539 0.8786 0.8219 0.8467 16.53 GB 0 early_attn=iq4_kt Mixed manual probe: early full-attention routing tensors at iq4_kt.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq4_kt
153 c299_finalffn_iq4xs 0.8539 0.8786 0.8219 0.8467 16.53 GB 0 final_ffn=iq4_xs Mixed manual probe: final FFN tensors at iq4_xs.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_xs
154 c229_ryssig_iq4xs 0.8539 0.8786 0.8219 0.8467 16.53 GB 0 rys_signal=iq4_xs Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_xs.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_xs
155 c350_dupsig_iq4kss 0.8539 0.8786 0.8219 0.8467 16.53 GB 0 dup_signal=iq4_kss Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at iq4_kss.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
156 c435_ryssig_iq4ksr4 0.8539 0.8786 0.8219 0.8467 16.53 GB 0 rys_signal=iq4_ks_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_ks_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks_r4
157 c269_lateattn_iq4xs 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 late_attn=iq4_xs Mixed manual probe: late full-attention control tensors at iq4_xs.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_xs
158 c366_duppath_mxfp4 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 dup_path=mxfp4 Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at mxfp4.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=mxfp4
159 c333_earlyattn_iq4ks 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 early_attn=iq4_ks Mixed manual probe: early full-attention routing tensors at iq4_ks.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq4_ks
160 c196_fav_iq4k 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 fa_v=iq4_k Mixed manual probe: full-attention V tensors at iq4_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq4_k
161 c354_dupsig_mxfp4 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 dup_signal=mxfp4 Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at mxfp4.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=mxfp4
162 c349_dupsig_iq4ks 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 dup_signal=iq4_ks Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at iq4_ks.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
163 c307_outtok_iq5k 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 output_token=iq5_k Mixed manual probe: token embedding and output tensors at iq5_k.
rules
^output\.weight=iq5_k
^token_embd\.weight=iq5_k
164 c481_outtok_q50r4 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 output_token=q5_0_r4 Mixed manual probe: token embedding and output tensors at q5_0_r4.
rules
^output\.weight=q5_0_r4
^token_embd\.weight=q5_0_r4
165 c484_outtok_q5kr4 0.8539 0.8786 0.8219 0.8467 16.54 GB 0 output_token=q5_k_r4 Mixed manual probe: token embedding and output tensors at q5_k_r4.
rules
^output\.weight=q5_k_r4
^token_embd\.weight=q5_k_r4
166 c465_latectl_q40r8 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 late_control=q4_0_r8 Mixed manual probe: late attention plus FFN gate/down control tensors at q4_0_r8.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q4_0_r8
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q4_0_r8
167 c216_ryspath_iq4k 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 rys_path=iq4_k Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_k.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_k
168 c236_bndall_iq4k 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 boundary_all=iq4_k Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq4_k.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq4_k
169 c176_faqk_iq4k 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 fa_qk=iq4_k Mixed manual probe: full-attention Q/K tensors at iq4_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_k
170 c197_fav_iq5k 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 fa_v=iq5_k Mixed manual probe: full-attention V tensors at iq5_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq5_k
171 c386_faqk_iq4kr4 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 fa_qk=iq4_k_r4 Mixed manual probe: full-attention Q/K tensors at iq4_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_k_r4
172 c398_faqk_q4kr4 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 fa_qk=q4_k_r4 Mixed manual probe: full-attention Q/K tensors at q4_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q4_k_r4
173 c440_ryssig_iq4nlr4 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 rys_signal=iq4_nl_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_nl_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_nl_r4
174 c449_ryssig_q4kr4 0.8539 0.8786 0.8219 0.8467 16.55 GB 0 rys_signal=q4_k_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q4_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q4_k_r4
175 c318_out_iq6k 0.8539 0.8786 0.8219 0.8467 16.56 GB 0 output_only=iq6_k Mixed manual probe: output tensor at iq6_k.
rules
^output\.weight=iq6_k
176 c198_fav_iq6k 0.8539 0.8786 0.8219 0.8467 16.57 GB 0 fa_v=iq6_k Mixed manual probe: full-attention V tensors at iq6_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq6_k
177 c345_srcsig_iq5ks 0.8539 0.8786 0.8219 0.8467 16.59 GB 0 src_signal=iq5_ks Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at iq5_ks.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks
178 c335_earlyattn_iq5ks 0.8539 0.8786 0.8219 0.8467 16.59 GB 0 early_attn=iq5_ks Mixed manual probe: early full-attention routing tensors at iq5_ks.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq5_ks
179 c627_fa_iq4k_lateffn_iq5ks 0.8539 0.8786 0.8219 0.8467 16.61 GB 0 fa=iq4_k, late_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k; late FFN gate/down control tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_ks
180 c633_fa_iq4k_finalffn_iq5ks 0.8539 0.8786 0.8219 0.8467 16.62 GB 0 fa=iq4_k, final_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k; final FFN tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq5_ks
181 c225_ryssig_iq5ks 0.8539 0.8786 0.8219 0.8467 16.62 GB 0 rys_signal=iq5_ks Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq5_ks.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks
182 c436_ryssig_iq5ksr4 0.8539 0.8786 0.8219 0.8467 16.62 GB 0 rys_signal=iq5_ks_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq5_ks_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks_r4
183 c621_fa_iq4k_bndffn_iq5ks 0.8539 0.8786 0.8219 0.8467 16.64 GB 0 fa=iq4_k, boundary_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_k; RYS entry, exit, and duplicate-boundary FFN tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq5_ks
184 c338_earlyattn_iq6k 0.8539 0.8786 0.8219 0.8467 16.66 GB 0 early_attn=iq6_k Mixed manual probe: early full-attention routing tensors at iq6_k.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq6_k
185 c453_latectl_iq5ksr4 0.8539 0.8786 0.8219 0.8467 16.67 GB 0 late_control=iq5_ks_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at iq5_ks_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_ks_r4
186 c385_faqk_iq5ksr4 0.8539 0.8786 0.8219 0.8467 16.67 GB 0 fa_qk=iq5_ks_r4 Mixed manual probe: full-attention Q/K tensors at iq5_ks_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq5_ks_r4
187 c421_ryspath_iq5kr4 0.8539 0.8786 0.8219 0.8467 16.69 GB 0 rys_path=iq5_k_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq5_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_k_r4
188 c187_faqkv_iq5k 0.8539 0.8786 0.8219 0.8467 16.71 GB 0 fa_qkv=iq5_k Mixed manual probe: full-attention Q/K/V tensors at iq5_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq5_k
189 c387_faqk_iq5kr4 0.8539 0.8786 0.8219 0.8467 16.71 GB 0 fa_qk=iq5_k_r4 Mixed manual probe: full-attention Q/K tensors at iq5_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq5_k_r4
190 c327_tok_iq5k 0.8539 0.8786 0.8219 0.8467 16.71 GB 0 token_only=iq5_k Mixed manual probe: token embedding tensor at iq5_k.
rules
^token_embd\.weight=iq5_k
191 c368_fa_iq5ksr4 0.8539 0.8786 0.8219 0.8467 16.72 GB 0 fa=iq5_ks_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks_r4
192 c237_bndall_iq5k 0.8539 0.8786 0.8219 0.8467 16.74 GB 0 boundary_all=iq5_k Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq5_k.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq5_k
193 c228_ryssig_iq6k 0.8539 0.8786 0.8219 0.8467 16.74 GB 0 rys_signal=iq6_k Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq6_k.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq6_k
194 c278_lateffn_iq6k 0.8539 0.8786 0.8219 0.8467 16.74 GB 0 late_ffn=iq6_k Mixed manual probe: late FFN gate/down control tensors at iq6_k.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq6_k
195 c676_fa_iq4kt_outtok_q6k 0.8539 0.8786 0.8219 0.8467 16.75 GB 0 fa=iq4_kt, output_token=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; token embedding and output tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^output\.weight=q6_K
^token_embd\.weight=q6_K
196 c539_fa_iq4kss_ryssig_q8 0.8539 0.8786 0.8219 0.8467 16.77 GB 0 fa=iq4_kss, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
197 c428_ryspath_q60 0.8539 0.8786 0.8219 0.8467 16.83 GB 0 rys_path=q6_0 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q6_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_0
198 c248_bndffn_iq6k 0.8539 0.8786 0.8219 0.8467 16.84 GB 0 boundary_ffn=iq6_k Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq6_k.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq6_k
199 c434_ryspath_q6kr4 0.8539 0.8786 0.8219 0.8467 16.84 GB 0 rys_path=q6_k_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q6_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_k_r4
200 c218_ryspath_iq6k 0.8539 0.8786 0.8219 0.8467 16.85 GB 0 rys_path=iq6_k Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq6_k.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq6_k
201 c444_ryssig_q8kvr8 0.8539 0.8786 0.8219 0.8467 16.86 GB 0 rys_signal=q8_KV_r8 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q8_KV_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_KV_r8
202 c463_latectl_q60r4 0.8539 0.8786 0.8219 0.8467 16.86 GB 0 late_control=q6_0_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at q6_0_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_0_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q6_0_r4
203 c395_faqk_q60r4 0.8539 0.8786 0.8219 0.8467 16.86 GB 0 fa_qk=q6_0_r4 Mixed manual probe: full-attention Q/K tensors at q6_0_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q6_0_r4
204 c480_outtok_q60r4 0.8539 0.8786 0.8219 0.8467 16.86 GB 0 output_token=q6_0_r4 Mixed manual probe: token embedding and output tensors at q6_0_r4.
rules
^output\.weight=q6_0_r4
^token_embd\.weight=q6_0_r4
205 c443_ryssig_q8kr8 0.8539 0.8786 0.8219 0.8467 16.86 GB 0 rys_signal=q8_k_r8 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q8_k_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_k_r8
206 c411_faqkv_q60 0.8539 0.8786 0.8219 0.8467 16.87 GB 0 fa_qkv=q6_0 Mixed manual probe: full-attention Q/K/V tensors at q6_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q6_0
207 c412_faqkv_q60r4 0.8539 0.8786 0.8219 0.8467 16.87 GB 0 fa_qkv=q6_0_r4 Mixed manual probe: full-attention Q/K/V tensors at q6_0_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q6_0_r4
208 c288_latectl_iq6k 0.8539 0.8786 0.8219 0.8467 16.88 GB 0 late_control=iq6_k Mixed manual probe: late attention plus FFN gate/down control tensors at iq6_k.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq6_k
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq6_k
209 c417_faqkv_q6kr4 0.8539 0.8786 0.8219 0.8467 16.88 GB 0 fa_qkv=q6_k_r4 Mixed manual probe: full-attention Q/K/V tensors at q6_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q6_k_r4
210 c442_ryssig_q8r8 0.8539 0.8786 0.8219 0.8467 16.90 GB 0 rys_signal=q8_0_r8 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q8_0_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0_r8
211 c308_outtok_iq6k 0.8539 0.8786 0.8219 0.8467 16.90 GB 0 output_token=iq6_k Mixed manual probe: token embedding and output tensors at iq6_k.
rules
^output\.weight=iq6_k
^token_embd\.weight=iq6_k
212 c35_bndall_q6k 0.8539 0.8786 0.8219 0.8467 16.94 GB 0 boundary_all=q6_K Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at q6_K.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q6_K
213 c238_bndall_iq6k 0.8539 0.8786 0.8219 0.8467 16.95 GB 0 boundary_all=iq6_k Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq6_k.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq6_k
214 c427_ryspath_q8kvr8 0.8539 0.8786 0.8219 0.8467 17.04 GB 0 rys_path=q8_KV_r8 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q8_KV_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_KV_r8
215 c461_latectl_q8kvr8 0.8539 0.8786 0.8219 0.8467 17.09 GB 0 late_control=q8_KV_r8 Mixed manual probe: late attention plus FFN gate/down control tensors at q8_KV_r8.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_KV_r8
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_KV_r8
216 c460_latectl_q8kr8 0.8539 0.8786 0.8219 0.8467 17.10 GB 0 late_control=q8_k_r8 Mixed manual probe: late attention plus FFN gate/down control tensors at q8_k_r8.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_k_r8
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_k_r8
217 c688_fa_iq4kt_rysffn_q6k 0.8539 0.8786 0.8219 0.8467 17.12 GB 0 fa=iq4_kt, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
218 c599_fa_iq5ks_finalffn_q8 0.8539 0.8786 0.8219 0.8467 17.12 GB 0 fa=iq5_ks, final_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; final FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
219 c409_faqkv_q8kr8 0.8539 0.8786 0.8219 0.8467 17.13 GB 0 fa_qkv=q8_k_r8 Mixed manual probe: full-attention Q/K/V tensors at q8_k_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q8_k_r8
220 c408_faqkv_q8r8 0.8539 0.8786 0.8219 0.8467 17.20 GB 0 fa_qkv=q8_0_r8 Mixed manual probe: full-attention Q/K/V tensors at q8_0_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q8_0_r8
221 c03_habit_boundary_q8 0.8539 0.8786 0.8219 0.8467 17.30 GB 0 ^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0 Protect the RYS entry/exit boundaries and duplicated boundary layers with Q8_0 across attention, SSM, and FFN tensors.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0
222 c45_lateattn_bf16 0.8539 0.8786 0.8219 0.8467 17.31 GB 0 late_attn=bf16 Mixed manual probe: late full-attention control tensors at bf16.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
223 c478_outtok_q8kvr8 0.8539 0.8786 0.8219 0.8467 17.34 GB 0 output_token=q8_KV_r8 Mixed manual probe: token embedding and output tensors at q8_KV_r8.
rules
^output\.weight=q8_KV_r8
^token_embd\.weight=q8_KV_r8
224 c580_fa_iq5ks_rysffn_q6k 0.8539 0.8786 0.8219 0.8467 17.41 GB 0 fa=iq5_ks, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
225 c477_outtok_q8kr8 0.8539 0.8786 0.8219 0.8467 17.43 GB 0 output_token=q8_k_r8 Mixed manual probe: token embedding and output tensors at q8_k_r8.
rules
^output\.weight=q8_k_r8
^token_embd\.weight=q8_k_r8
226 c702_fa_iq4kt_lateffn_bf16 0.8539 0.8786 0.8219 0.8467 17.45 GB 0 fa=iq4_kt, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
227 c476_outtok_q8r8 0.8539 0.8786 0.8219 0.8467 17.50 GB 0 output_token=q8_0_r8 Mixed manual probe: token embedding and output tensors at q8_0_r8.
rules
^output\.weight=q8_0_r8
^token_embd\.weight=q8_0_r8
228 c689_fa_iq4kt_rysffn_q8 0.8539 0.8786 0.8219 0.8467 17.76 GB 0 fa=iq4_kt, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
229 c89_fa_q8_ryssig_q8 0.8539 0.8786 0.8219 0.8467 17.83 GB 0 fa=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
230 c672_fa_iq5k_finalffn_bf16 0.8539 0.8786 0.8219 0.8467 17.93 GB 0 fa=iq5_k, final_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; final FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
231 c114_fao_bf16_faqk_q8 0.8539 0.8786 0.8219 0.8467 17.98 GB 0 fa_o=bf16, fa_qk=q8_0 Mixed manual probe: full-attention output tensors at bf16; full-attention Q/K tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
232 c15_faqk_bf16 0.8539 0.8786 0.8219 0.8467 18.32 GB 0 fa_qk=bf16 Mixed manual probe: full-attention Q/K tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=bf16
233 c87_fa_q8_outtok_q8 0.8539 0.8786 0.8219 0.8467 18.43 GB 0 fa=q8_0, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
234 c101_fa_bf16_ryssig_q8 0.8539 0.8786 0.8219 0.8467 19.60 GB 0 fa=bf16, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
235 c534_fa_iq4kss_outtok_bf16 0.8539 0.8786 0.8219 0.8467 19.75 GB 0 fa=iq4_kss, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^output\.weight=bf16
^token_embd\.weight=bf16
236 c105_fa_bf16_bndffn_q8 0.8539 0.8786 0.8219 0.8467 19.79 GB 0 fa=bf16, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
237 c99_fa_bf16_outtok_q8 0.8539 0.8786 0.8219 0.8467 20.20 GB 0 fa=bf16, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
238 c102_fa_bf16_ryssig_bf16 0.8539 0.8786 0.8219 0.8467 20.26 GB 0 fa=bf16, rys_signal=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
239 c319_out_iq4xs 0.8539 0.9143 0.7752 0.8360 16.19 GB 0 output_only=iq4_xs Mixed manual probe: output tensor at iq4_xs.
rules
^output\.weight=iq4_xs
240 c644_fa_iq5k_ryssig_iq4kss 0.8539 0.9143 0.7752 0.8360 16.74 GB 0 fa=iq5_k, rys_signal=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; RYS 15..24 linear-attention and SSM signal path at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
241 c142_fa_q8_outtok_q8_lateffn_q8 0.8539 0.9143 0.7752 0.8360 18.79 GB 0 fa=q8_0, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
242 c148_fa_bf16_outtok_q8_lateffn_q8 0.8539 0.9143 0.7752 0.8360 20.56 GB 0 fa=bf16, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
243 c211_fao_iq3kt 0.8524 0.8929 0.7996 0.8404 16.46 GB 0 fa_o=iq3_kt Mixed manual probe: full-attention output tensors at iq3_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=iq3_kt
244 c466_latectl_q4kr4 0.8520 0.8786 0.8174 0.8442 16.55 GB 0 late_control=q4_k_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at q4_k_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q4_k_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q4_k_r4
245 c298_finalffn_iq6k 0.8520 0.8786 0.8174 0.8442 16.77 GB 0 final_ffn=iq6_k Mixed manual probe: final FFN tensors at iq6_k.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq6_k
246 c722_fa_iq3kt_rysffn_iq4kss 0.8497 0.8929 0.7935 0.8370 16.05 GB 0 fa=iq3_kt, rys_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; RYS 15..24 FFN tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_kss
247 c262_rysffn_mxfp4 0.8471 0.8786 0.8061 0.8378 16.47 GB 0 rys_ffn=mxfp4 Mixed manual probe: RYS 15..24 FFN tensors at mxfp4.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=mxfp4
248 c309_outtok_iq4xs 0.8462 0.8786 0.8041 0.8367 16.15 GB 0 output_token=iq4_xs Mixed manual probe: token embedding and output tensors at iq4_xs.
rules
^output\.weight=iq4_xs
^token_embd\.weight=iq4_xs
249 c482_outtok_q40r8 0.8462 0.8786 0.8041 0.8367 16.23 GB 0 output_token=q4_0_r8 Mixed manual probe: token embedding and output tensors at q4_0_r8.
rules
^output\.weight=q4_0_r8
^token_embd\.weight=q4_0_r8
250 c424_ryspath_iq3kr4 0.8462 0.8786 0.8041 0.8367 16.40 GB 0 rys_path=iq3_k_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq3_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_k_r4
251 c302_finalffn_mxfp4 0.8462 0.8786 0.8041 0.8367 16.53 GB 0 final_ffn=mxfp4 Mixed manual probe: final FFN tensors at mxfp4.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=mxfp4
252 c282_lateffn_mxfp4 0.8462 0.8786 0.8041 0.8367 16.53 GB 0 late_ffn=mxfp4 Mixed manual probe: late FFN gate/down control tensors at mxfp4.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=mxfp4
253 c232_ryssig_mxfp4 0.8462 0.8786 0.8041 0.8367 16.53 GB 0 rys_signal=mxfp4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at mxfp4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=mxfp4
254 c381_fa_q4kr4 0.8462 0.8786 0.8041 0.8367 16.54 GB 0 fa=q4_k_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at q4_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q4_k_r4
255 c195_fav_iq5ks 0.8462 0.8786 0.8041 0.8367 16.55 GB 0 fa_v=iq5_ks Mixed manual probe: full-attention V tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq5_ks
256 c419_ryspath_iq5ksr4 0.8462 0.8786 0.8041 0.8367 16.66 GB 0 rys_path=iq5_ks_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq5_ks_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks_r4
257 c321_out_iq3kt 0.8460 0.8786 0.8035 0.8364 16.01 GB 0 output_only=iq3_kt Mixed manual probe: output tensor at iq3_kt.
rules
^output\.weight=iq3_kt
258 c346_srcsig_iq4kt 0.8460 0.8786 0.8035 0.8364 16.53 GB 0 src_signal=iq4_kt Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at iq4_kt.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kt
259 c295_finalffn_iq5ks 0.8460 0.8786 0.8035 0.8364 16.63 GB 0 final_ffn=iq5_ks Mixed manual probe: final FFN tensors at iq5_ks.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq5_ks
260 c287_latectl_iq5k 0.8452 0.8786 0.8019 0.8354 16.71 GB 0 late_control=iq5_k Mixed manual probe: late attention plus FFN gate/down control tensors at iq5_k.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_k
261 c591_fa_iq5ks_lateffn_iq5ks 0.8452 0.8786 0.8019 0.8354 16.79 GB 0 fa=iq5_ks, late_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; late FFN gate/down control tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_ks
262 c393_faqk_q8kvr8 0.8452 0.8786 0.8019 0.8354 17.09 GB 0 fa_qk=q8_KV_r8 Mixed manual probe: full-attention Q/K tensors at q8_KV_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_KV_r8
263 c475_outtok_iq3kr4 0.8443 0.8786 0.7996 0.8342 15.89 GB 0 output_token=iq3_k_r4 Mixed manual probe: token embedding and output tensors at iq3_k_r4.
rules
^output\.weight=iq3_k_r4
^token_embd\.weight=iq3_k_r4
264 c529_fa_iq4kss_outtok_iq4ks 0.8443 0.8786 0.7996 0.8342 16.02 GB 0 fa=iq4_kss, output_token=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; token embedding and output tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^output\.weight=iq4_ks
^token_embd\.weight=iq4_ks
265 c473_outtok_iq4xsr8 0.8443 0.8786 0.7996 0.8342 16.15 GB 0 output_token=iq4_xs_r8 Mixed manual probe: token embedding and output tensors at iq4_xs_r8.
rules
^output\.weight=iq4_xs_r8
^token_embd\.weight=iq4_xs_r8
266 c303_outtok_iq4ks 0.8443 0.8786 0.7996 0.8342 16.15 GB 0 output_token=iq4_ks Mixed manual probe: token embedding and output tensors at iq4_ks.
rules
^output\.weight=iq4_ks
^token_embd\.weight=iq4_ks
267 c313_out_iq4ks 0.8443 0.8786 0.7996 0.8342 16.19 GB 0 output_only=iq4_ks Mixed manual probe: output tensor at iq4_ks.
rules
^output\.weight=iq4_ks
268 c316_out_iq4k 0.8443 0.8786 0.7996 0.8342 16.23 GB 0 output_only=iq4_k Mixed manual probe: output tensor at iq4_k.
rules
^output\.weight=iq4_k
269 c483_outtok_q4kr4 0.8443 0.8786 0.7996 0.8342 16.23 GB 0 output_token=q4_k_r4 Mixed manual probe: token embedding and output tensors at q4_k_r4.
rules
^output\.weight=q4_k_r4
^token_embd\.weight=q4_k_r4
270 c741_fa_iq3kt_finalffn_iq5ks 0.8443 0.8786 0.7996 0.8342 16.30 GB 0 fa=iq3_kt, final_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; final FFN tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq5_ks
271 c331_tok_iq3kt 0.8443 0.8786 0.7996 0.8342 16.34 GB 0 token_only=iq3_kt Mixed manual probe: token embedding tensor at iq3_kt.
rules
^token_embd\.weight=iq3_kt
272 c291_latectl_iq3kt 0.8443 0.8786 0.7996 0.8342 16.34 GB 0 late_control=iq3_kt Mixed manual probe: late attention plus FFN gate/down control tensors at iq3_kt.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq3_kt
273 c181_faqk_iq3kt 0.8443 0.8786 0.7996 0.8342 16.34 GB 0 fa_qk=iq3_kt Mixed manual probe: full-attention Q/K tensors at iq3_kt.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq3_kt
274 c554_fa_iq4kss_lateffn_iq4kss 0.8443 0.8786 0.7996 0.8342 16.38 GB 0 fa=iq4_kss, late_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; late FFN gate/down control tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kss
275 c698_fa_iq4kt_lateffn_iq4kss 0.8443 0.8786 0.7996 0.8342 16.38 GB 0 fa=iq4_kt, late_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; late FFN gate/down control tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kss
276 c680_fa_iq4kt_ryssig_iq4kss 0.8443 0.8786 0.7996 0.8342 16.38 GB 0 fa=iq4_kt, rys_signal=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS 15..24 linear-attention and SSM signal path at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
277 c260_rysffn_iq4kt 0.8443 0.8786 0.7996 0.8342 16.39 GB 0 rys_ffn=iq4_kt Mixed manual probe: RYS 15..24 FFN tensors at iq4_kt.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_kt
278 c559_fa_iq4kss_finalffn_iq4ks 0.8443 0.8786 0.7996 0.8342 16.40 GB 0 fa=iq4_kss, final_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; final FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_ks
279 c301_finalffn_iq3kt 0.8443 0.8786 0.7996 0.8342 16.42 GB 0 final_ffn=iq3_kt Mixed manual probe: final FFN tensors at iq3_kt.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq3_kt
280 c511_fa_iq4ks_bndffn_iq4ks 0.8443 0.8786 0.7996 0.8342 16.45 GB 0 fa=iq4_ks, boundary_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; RYS entry, exit, and duplicate-boundary FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_ks
281 c365_duppath_iq3kt 0.8443 0.8786 0.7996 0.8342 16.46 GB 0 dup_path=iq3_kt Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at iq3_kt.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_kt
282 c305_outtok_iq5ks 0.8443 0.8786 0.7996 0.8342 16.47 GB 0 output_token=iq5_ks Mixed manual probe: token embedding and output tensors at iq5_ks.
rules
^output\.weight=iq5_ks
^token_embd\.weight=iq5_ks
283 c284_latectl_iq4kss 0.8443 0.8786 0.7996 0.8342 16.47 GB 0 late_control=iq4_kss Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_kss.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_kss
284 c324_tok_iq4kss 0.8443 0.8786 0.7996 0.8342 16.48 GB 0 token_only=iq4_kss Mixed manual probe: token embedding tensor at iq4_kss.
rules
^token_embd\.weight=iq4_kss
285 c341_earlyattn_iq3kt 0.8443 0.8786 0.7996 0.8342 16.48 GB 0 early_attn=iq3_kt Mixed manual probe: early full-attention routing tensors at iq3_kt.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq3_kt
286 c367_fa_iq4ksr4 0.8443 0.8786 0.7996 0.8342 16.49 GB 0 fa=iq4_ks_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks_r4
287 c347_srcsig_iq3kt 0.8443 0.8786 0.7996 0.8342 16.50 GB 0 src_signal=iq3_kt Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at iq3_kt.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_kt
288 c730_fa_iq3kt_bndffn_q6k 0.8443 0.8786 0.7996 0.8342 16.50 GB 0 fa=iq3_kt, boundary_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; RYS entry, exit, and duplicate-boundary FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q6_K
289 c405_faqkv_iq4xsr8 0.8443 0.8786 0.7996 0.8342 16.50 GB 0 fa_qkv=iq4_xs_r8 Mixed manual probe: full-attention Q/K/V tensors at iq4_xs_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_xs_r8
290 c401_faqkv_iq4ksr4 0.8443 0.8786 0.7996 0.8342 16.50 GB 0 fa_qkv=iq4_ks_r4 Mixed manual probe: full-attention Q/K/V tensors at iq4_ks_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_ks_r4
291 c452_latectl_iq4ksr4 0.8443 0.8786 0.7996 0.8342 16.51 GB 0 late_control=iq4_ks_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_ks_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_ks_r4
292 c179_faqk_iq4xs 0.8443 0.8786 0.7996 0.8342 16.52 GB 0 fa_qk=iq4_xs Mixed manual probe: full-attention Q/K tensors at iq4_xs.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_xs
293 c362_duppath_iq4kss 0.8443 0.8786 0.7996 0.8342 16.52 GB 0 dup_path=iq4_kss Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at iq4_kss.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
294 c249_bndffn_iq4xs 0.8443 0.8786 0.7996 0.8342 16.52 GB 0 boundary_ffn=iq4_xs Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq4_xs.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_xs
295 c549_fa_iq4kss_bndffn_iq5ks 0.8443 0.8786 0.7996 0.8342 16.53 GB 0 fa=iq4_kss, boundary_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kss; RYS entry, exit, and duplicate-boundary FFN tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq5_ks
296 c439_ryssig_iq4xsr8 0.8443 0.8786 0.7996 0.8342 16.53 GB 0 rys_signal=iq4_xs_r8 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_xs_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_xs_r8
297 c344_srcsig_iq4kss 0.8443 0.8786 0.7996 0.8342 16.53 GB 0 src_signal=iq4_kss Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at iq4_kss.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
298 c352_dupsig_iq4kt 0.8443 0.8786 0.7996 0.8342 16.53 GB 0 dup_signal=iq4_kt Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at iq4_kt.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kt
299 c223_ryssig_iq4ks 0.8443 0.8786 0.7996 0.8342 16.53 GB 0 rys_signal=iq4_ks Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_ks.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
300 c272_lateattn_mxfp4 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 late_attn=mxfp4 Mixed manual probe: late full-attention control tensors at mxfp4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=mxfp4
301 c263_lateattn_iq4ks 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 late_attn=iq4_ks Mixed manual probe: late full-attention control tensors at iq4_ks.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
302 c360_srcpath_mxfp4 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 src_path=mxfp4 Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at mxfp4.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=mxfp4
303 c355_srcpath_iq4ks 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 src_path=iq4_ks Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at iq4_ks.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
304 c361_duppath_iq4ks 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 dup_path=iq4_ks Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at iq4_ks.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
305 c212_fao_mxfp4 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 fa_o=mxfp4 Mixed manual probe: full-attention output tensors at mxfp4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=mxfp4
306 c342_earlyattn_mxfp4 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 early_attn=mxfp4 Mixed manual probe: early full-attention routing tensors at mxfp4.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=mxfp4
307 c199_fav_iq4xs 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 fa_v=iq4_xs Mixed manual probe: full-attention V tensors at iq4_xs.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq4_xs
308 c372_fa_iq4nlr4 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 fa=iq4_nl_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_nl_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_nl_r4
309 c380_fa_q40r8 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 fa=q4_0_r8 Mixed manual probe: all full-attention Q/K/V/O tensors at q4_0_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q4_0_r8
310 c403_faqkv_iq4kr4 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 fa_qkv=iq4_k_r4 Mixed manual probe: full-attention Q/K/V tensors at iq4_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_k_r4
311 c343_srcsig_iq4ks 0.8443 0.8786 0.7996 0.8342 16.54 GB 0 src_signal=iq4_ks Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at iq4_ks.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_ks
312 c266_lateattn_iq4k 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 late_attn=iq4_k Mixed manual probe: late full-attention control tensors at iq4_k.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
313 c286_latectl_iq4k 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 late_control=iq4_k Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_k.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_k
314 c420_ryspath_iq4kr4 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 rys_path=iq4_k_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq4_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_k_r4
315 c246_bndffn_iq4k 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 boundary_ffn=iq4_k Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq4_k.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_k
316 c276_lateffn_iq4k 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 late_ffn=iq4_k Mixed manual probe: late FFN gate/down control tensors at iq4_k.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_k
317 c296_finalffn_iq4k 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 final_ffn=iq4_k Mixed manual probe: final FFN tensors at iq4_k.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_k
318 c326_tok_iq4k 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 token_only=iq4_k Mixed manual probe: token embedding tensor at iq4_k.
rules
^token_embd\.weight=iq4_k
319 c389_faqk_iq4nlr4 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 fa_qk=iq4_nl_r4 Mixed manual probe: full-attention Q/K tensors at iq4_nl_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_nl_r4
320 c437_ryssig_iq4kr4 0.8443 0.8786 0.7996 0.8342 16.55 GB 0 rys_signal=iq4_k_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq4_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_k_r4
321 c489_fav_q8kv 0.8443 0.8786 0.7996 0.8342 16.58 GB 0 fa_v=q8_KV Mixed manual probe: full-attention V tensors at q8_KV.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=q8_KV
322 c351_dupsig_iq5ks 0.8443 0.8786 0.7996 0.8342 16.59 GB 0 dup_signal=iq5_ks Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at iq5_ks.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks
323 c265_lateattn_iq5ks 0.8443 0.8786 0.7996 0.8342 16.60 GB 0 late_attn=iq5_ks Mixed manual probe: late full-attention control tensors at iq5_ks.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
324 c337_earlyattn_iq5k 0.8443 0.8786 0.7996 0.8342 16.60 GB 0 early_attn=iq5_k Mixed manual probe: early full-attention routing tensors at iq5_k.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=iq5_k
325 c363_duppath_iq5ks 0.8443 0.8786 0.7996 0.8342 16.61 GB 0 dup_path=iq5_ks Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at iq5_ks.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks
326 c275_lateffn_iq5ks 0.8443 0.8786 0.7996 0.8342 16.62 GB 0 late_ffn=iq5_ks Mixed manual probe: late FFN gate/down control tensors at iq5_ks.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_ks
327 c227_ryssig_iq5k 0.8443 0.8786 0.7996 0.8342 16.64 GB 0 rys_signal=iq5_k Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq5_k.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_k
328 c438_ryssig_iq5kr4 0.8443 0.8786 0.7996 0.8342 16.64 GB 0 rys_signal=iq5_k_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq5_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_k_r4
329 c584_fa_iq5ks_bndffn_iq4kss 0.8443 0.8786 0.7996 0.8342 16.66 GB 0 fa=iq5_ks, boundary_ffn=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS entry, exit, and duplicate-boundary FFN tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq4_kss
330 c215_ryspath_iq5ks 0.8443 0.8786 0.7996 0.8342 16.66 GB 0 rys_path=iq5_ks Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq5_ks.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks
331 c285_latectl_iq5ks 0.8443 0.8786 0.7996 0.8342 16.67 GB 0 late_control=iq5_ks Mixed manual probe: late attention plus FFN gate/down control tensors at iq5_ks.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_ks
332 c325_tok_iq5ks 0.8443 0.8786 0.7996 0.8342 16.68 GB 0 token_only=iq5_ks Mixed manual probe: token embedding tensor at iq5_ks.
rules
^token_embd\.weight=iq5_ks
333 c247_bndffn_iq5k 0.8443 0.8786 0.7996 0.8342 16.69 GB 0 boundary_ffn=iq5_k Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq5_k.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq5_k
334 c268_lateattn_iq6k 0.8443 0.8786 0.7996 0.8342 16.69 GB 0 late_attn=iq6_k Mixed manual probe: late full-attention control tensors at iq6_k.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq6_k
335 c177_faqk_iq5k 0.8443 0.8786 0.7996 0.8342 16.71 GB 0 fa_qk=iq5_k Mixed manual probe: full-attention Q/K tensors at iq5_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq5_k
336 c396_faqk_q50r4 0.8443 0.8786 0.7996 0.8342 16.71 GB 0 fa_qk=q5_0_r4 Mixed manual probe: full-attention Q/K tensors at q5_0_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q5_0_r4
337 c416_faqkv_q5kr4 0.8443 0.8786 0.7996 0.8342 16.71 GB 0 fa_qkv=q5_k_r4 Mixed manual probe: full-attention Q/K/V tensors at q5_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q5_k_r4
338 c445_ryssig_q60 0.8443 0.8786 0.7996 0.8342 16.73 GB 0 rys_signal=q6_0 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q6_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_0
339 c446_ryssig_q60r4 0.8443 0.8786 0.7996 0.8342 16.73 GB 0 rys_signal=q6_0_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q6_0_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_0_r4
340 c451_ryssig_q6kr4 0.8443 0.8786 0.7996 0.8342 16.73 GB 0 rys_signal=q6_k_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q6_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_k_r4
341 c394_faqk_q60 0.8443 0.8786 0.7996 0.8342 16.86 GB 0 fa_qk=q6_0 Mixed manual probe: full-attention Q/K tensors at q6_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q6_0
342 c479_outtok_q60 0.8443 0.8786 0.7996 0.8342 16.86 GB 0 output_token=q6_0 Mixed manual probe: token embedding and output tensors at q6_0.
rules
^output\.weight=q6_0
^token_embd\.weight=q6_0
343 c485_outtok_q6kr4 0.8443 0.8786 0.7996 0.8342 16.88 GB 0 output_token=q6_k_r4 Mixed manual probe: token embedding and output tensors at q6_k_r4.
rules
^output\.weight=q6_k_r4
^token_embd\.weight=q6_k_r4
344 c328_tok_iq6k 0.8443 0.8786 0.7996 0.8342 16.89 GB 0 token_only=iq6_k Mixed manual probe: token embedding tensor at iq6_k.
rules
^token_embd\.weight=iq6_k
345 c188_faqkv_iq6k 0.8443 0.8786 0.7996 0.8342 16.89 GB 0 fa_qkv=iq6_k Mixed manual probe: full-attention Q/K/V tensors at iq6_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq6_k
346 c426_ryspath_q8kr8 0.8443 0.8786 0.7996 0.8342 17.05 GB 0 rys_path=q8_k_r8 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q8_k_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_k_r8
347 c575_fa_iq5ks_ryssig_q8 0.8443 0.8786 0.7996 0.8342 17.07 GB 0 fa=iq5_ks, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
348 c392_faqk_q8kr8 0.8443 0.8786 0.7996 0.8342 17.10 GB 0 fa_qk=q8_k_r8 Mixed manual probe: full-attention Q/K tensors at q8_k_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_k_r8
349 c459_latectl_q8r8 0.8443 0.8786 0.7996 0.8342 17.17 GB 0 late_control=q8_0_r8 Mixed manual probe: late attention plus FFN gate/down control tensors at q8_0_r8.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0_r8
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0_r8
350 c738_fa_iq3kt_lateffn_bf16 0.8443 0.8786 0.7996 0.8342 17.25 GB 0 fa=iq3_kt, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
351 c63_fa_q5k_outtok_q8 0.8443 0.8786 0.7996 0.8342 17.72 GB 0 fa=q5_K, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
352 c09_union_rys_path_q8_late_control_q8 0.8443 0.8786 0.7996 0.8342 17.72 GB 0 ^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0; ^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0; ^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0 Union of RYS-window path retention and late execution-control retention.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
353 c714_fa_iq3kt_outtok_bf16 0.8443 0.8786 0.7996 0.8342 19.55 GB 0 fa=iq3_kt, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^output\.weight=bf16
^token_embd\.weight=bf16
354 c690_fa_iq4kt_rysffn_bf16 0.8443 0.8786 0.7996 0.8342 20.27 GB 0 fa=iq4_kt, rys_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS 15..24 FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=bf16
355 c582_fa_iq5ks_rysffn_bf16 0.8443 0.8786 0.7996 0.8342 20.57 GB 0 fa=iq5_ks, rys_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS 15..24 FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=bf16
356 c757_fa_mxfp4_rysffn_iq4ks 0.8436 0.8786 0.7980 0.8332 16.40 GB 0 fa=mxfp4, rys_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at mxfp4; RYS 15..24 FFN tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=mxfp4
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_ks
357 c310_outtok_iq4kt 0.8433 0.8786 0.7974 0.8329 16.07 GB 0 output_token=iq4_kt Mixed manual probe: token embedding and output tensors at iq4_kt.
rules
^output\.weight=iq4_kt
^token_embd\.weight=iq4_kt
358 c359_srcpath_iq3kt 0.8433 0.8786 0.7974 0.8329 16.46 GB 0 src_path=iq3_kt Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at iq3_kt.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_kt
359 c441_ryssig_iq3kr4 0.8433 0.8786 0.7974 0.8329 16.46 GB 0 rys_signal=iq3_k_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at iq3_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq3_k_r4
360 c256_rysffn_iq4k 0.8433 0.8786 0.7974 0.8329 16.55 GB 0 rys_ffn=iq4_k Mixed manual probe: RYS 15..24 FFN tensors at iq4_k.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq4_k
361 c235_bndall_iq5ks 0.8433 0.8786 0.7974 0.8329 16.69 GB 0 boundary_all=iq5_ks Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq5_ks.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq5_ks
362 c589_fa_iq5ks_lateffn_iq4ks 0.8433 0.8786 0.7974 0.8329 16.70 GB 0 fa=iq5_ks, late_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; late FFN gate/down control tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_ks
363 c661_fa_iq5k_lateffn_iq4ks 0.8433 0.8786 0.7974 0.8329 16.76 GB 0 fa=iq5_k, late_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_k; late FFN gate/down control tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_ks
364 c258_rysffn_iq6k 0.8433 0.8786 0.7974 0.8329 17.27 GB 0 rys_ffn=iq6_k Mixed manual probe: RYS 15..24 FFN tensors at iq6_k.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq6_k
365 c283_latectl_iq4ks 0.8424 0.8786 0.7952 0.8317 16.51 GB 0 late_control=iq4_ks Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_ks.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_ks
366 c356_srcpath_iq4kss 0.8424 0.8786 0.7952 0.8317 16.52 GB 0 src_path=iq4_kss Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at iq4_kss.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq4_kss
367 c414_faqkv_q40r8 0.8424 0.8786 0.7952 0.8317 16.54 GB 0 fa_qkv=q4_0_r8 Mixed manual probe: full-attention Q/K/V tensors at q4_0_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q4_0_r8
368 c468_latectl_q6kr4 0.8424 0.8786 0.7952 0.8317 16.87 GB 0 late_control=q6_k_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at q6_k_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_k_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q6_k_r4
369 c112_fav_bf16_fao_bf16 0.8424 0.8786 0.7952 0.8317 17.49 GB 0 fa_v=bf16, fa_o=bf16 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
370 c264_lateattn_iq4kss 0.8416 0.8786 0.7935 0.8307 16.52 GB 0 late_attn=iq4_kss Mixed manual probe: late full-attention control tensors at iq4_kss.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kss
371 c69_fa_q5k_bndffn_q8 0.8407 0.8786 0.7915 0.8296 17.31 GB 0 fa=q5_K, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
372 c430_ryspath_q50r4 0.8404 0.8786 0.7907 0.8292 16.69 GB 0 rys_path=q5_0_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q5_0_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q5_0_r4
373 c320_out_iq4kt 0.8390 0.8786 0.7874 0.8273 16.15 GB 0 output_only=iq4_kt Mixed manual probe: output tensor at iq4_kt.
rules
^output\.weight=iq4_kt
374 c425_ryspath_q8r8 0.8375 0.8786 0.7841 0.8254 17.11 GB 0 rys_path=q8_0_r8 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q8_0_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0_r8
375 c588_fa_iq5ks_bndffn_bf16 0.8365 0.9143 0.7352 0.8135 18.26 GB 0 fa=iq5_ks, boundary_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
376 c96_fa_q8_lateffn_bf16 0.8365 0.9143 0.7352 0.8135 18.51 GB 0 fa=q8_0, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
377 c255_rysffn_iq5ks 0.8363 0.8786 0.7813 0.8239 16.81 GB 0 rys_ffn=iq5_ks Mixed manual probe: RYS 15..24 FFN tensors at iq5_ks.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq5_ks
378 c455_latectl_iq5kr4 0.8356 0.8786 0.7796 0.8229 16.71 GB 0 late_control=iq5_k_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at iq5_k_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_k_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq5_k_r4
379 c390_faqk_iq3kr4 0.8346 0.8786 0.7774 0.8217 16.39 GB 0 fa_qk=iq3_k_r4 Mixed manual probe: full-attention Q/K tensors at iq3_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq3_k_r4
380 c447_ryssig_q50r4 0.8344 0.8786 0.7769 0.8214 16.64 GB 0 rys_signal=q5_0_r4 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q5_0_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q5_0_r4
381 c469_outtok_iq4ksr4 0.8337 0.8429 0.8219 0.8310 16.15 GB 0 output_token=iq4_ks_r4 Mixed manual probe: token embedding and output tensors at iq4_ks_r4.
rules
^output\.weight=iq4_ks_r4
^token_embd\.weight=iq4_ks_r4
382 c294_finalffn_iq4kss 0.8337 0.8429 0.8219 0.8310 16.51 GB 0 final_ffn=iq4_kss Mixed manual probe: final FFN tensors at iq4_kss.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_kss
383 c292_latectl_mxfp4 0.8337 0.8429 0.8219 0.8310 16.51 GB 0 late_control=mxfp4 Mixed manual probe: late attention plus FFN gate/down control tensors at mxfp4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=mxfp4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=mxfp4
384 c194_fav_iq4kss 0.8337 0.8429 0.8219 0.8310 16.54 GB 0 fa_v=iq4_kss Mixed manual probe: full-attention V tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=iq4_kss
385 c406_faqkv_iq4nlr4 0.8337 0.8429 0.8219 0.8310 16.54 GB 0 fa_qkv=iq4_nl_r4 Mixed manual probe: full-attention Q/K/V tensors at iq4_nl_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_nl_r4
386 c707_fa_iq4kt_finalffn_q8 0.8337 0.8429 0.8219 0.8310 16.83 GB 0 fa=iq4_kt, final_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; final FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
387 c462_latectl_q60 0.8337 0.8429 0.8219 0.8310 16.86 GB 0 late_control=q6_0 Mixed manual probe: late attention plus FFN gate/down control tensors at q6_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q6_0
388 c240_bndall_iq4kt 0.8337 0.8786 0.7752 0.8204 16.46 GB 0 boundary_all=iq4_kt Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at iq4_kt.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=iq4_kt
389 c432_ryspath_q4kr4 0.8337 0.8786 0.7752 0.8204 16.55 GB 0 rys_path=q4_k_r4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q4_k_r4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q4_k_r4
390 c448_ryssig_q40r8 0.8337 0.8786 0.7752 0.8204 16.55 GB 0 rys_signal=q4_0_r8 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q4_0_r8.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q4_0_r8
391 c399_faqk_q5kr4 0.8337 0.8786 0.7752 0.8204 16.71 GB 0 fa_qk=q5_k_r4 Mixed manual probe: full-attention Q/K tensors at q5_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q5_k_r4
392 c490_ryspath_q8kv 0.8337 0.8786 0.7752 0.8204 17.04 GB 0 rys_path=q8_KV Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q8_KV.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_KV
393 c410_faqkv_q8kvr8 0.8337 0.8786 0.7752 0.8204 17.12 GB 0 fa_qkv=q8_KV_r8 Mixed manual probe: full-attention Q/K/V tensors at q8_KV_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q8_KV_r8
394 c126_bndall_bf16_lateffn_q8 0.8337 0.8786 0.7752 0.8204 19.05 GB 0 boundary_all=bf16, late_ffn=q8_0 Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at bf16; late FFN gate/down control tensors at q8_0.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
395 c257_rysffn_iq5k 0.8336 0.8614 0.7974 0.8254 16.89 GB 0 rys_ffn=iq5_k Mixed manual probe: RYS 15..24 FFN tensors at iq5_k.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq5_k
396 c710_fa_iq3kt_outtok_iq4kss 0.8322 0.8571 0.7996 0.8248 15.74 GB 0 fa=iq3_kt, output_token=iq4_kss Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; token embedding and output tensors at iq4_kss.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^output\.weight=iq4_kss
^token_embd\.weight=iq4_kss
397 c400_faqk_q6kr4 0.8298 0.8786 0.7663 0.8154 16.87 GB 0 fa_qk=q6_k_r4 Mixed manual probe: full-attention Q/K tensors at q6_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q6_k_r4
398 c261_rysffn_iq3kt 0.8281 0.8500 0.7996 0.8217 16.10 GB 0 rys_ffn=iq3_kt Mixed manual probe: RYS 15..24 FFN tensors at iq3_kt.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq3_kt
399 c388_faqk_iq4xsr8 0.8281 0.8500 0.7996 0.8217 16.52 GB 0 fa_qk=iq4_xs_r8 Mixed manual probe: full-attention Q/K tensors at iq4_xs_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq4_xs_r8
400 c202_fav_mxfp4 0.8276 0.8429 0.8078 0.8231 16.54 GB 0 fa_v=mxfp4 Mixed manual probe: full-attention V tensors at mxfp4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=mxfp4
401 c83_fa_q6k_lateffn_q8 0.8268 0.9143 0.7130 0.8010 17.39 GB 0 fa=q6_K, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
402 c293_finalffn_iq4ks 0.8241 0.8429 0.7996 0.8185 16.53 GB 0 final_ffn=iq4_ks Mixed manual probe: final FFN tensors at iq4_ks.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq4_ks
403 c694_fa_iq4kt_bndffn_q6k 0.8241 0.8429 0.7996 0.8185 16.70 GB 0 fa=iq4_kt, boundary_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS entry, exit, and duplicate-boundary FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q6_K
404 c745_fa_mxfp4_outtok_iq4ks 0.8234 0.8429 0.7980 0.8176 16.08 GB 0 fa=mxfp4, output_token=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at mxfp4; token embedding and output tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=mxfp4
^output\.weight=iq4_ks
^token_embd\.weight=iq4_ks
405 c517_fa_iq4ks_lateffn_iq4ks 0.8163 0.8786 0.7352 0.7979 16.46 GB 0 fa=iq4_ks, late_ffn=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; late FFN gate/down control tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_ks
406 c163_fa_iq4ks 0.8163 0.8786 0.7352 0.7979 16.49 GB 0 fa=iq4_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
407 c186_faqkv_iq4k 0.8163 0.8786 0.7352 0.7979 16.54 GB 0 fa_qkv=iq4_k Mixed manual probe: full-attention Q/K/V tensors at iq4_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq4_k
408 c454_latectl_iq4kr4 0.8163 0.8786 0.7352 0.7979 16.55 GB 0 late_control=iq4_k_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_k_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_k_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_k_r4
409 c457_latectl_iq4nlr4 0.8163 0.8786 0.7352 0.7979 16.55 GB 0 late_control=iq4_nl_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at iq4_nl_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_nl_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq4_nl_r4
410 c297_finalffn_iq5k 0.8163 0.8786 0.7352 0.7979 16.66 GB 0 final_ffn=iq5_k Mixed manual probe: final FFN tensors at iq5_k.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=iq5_k
411 c175_faqk_iq5ks 0.8163 0.8786 0.7352 0.7979 16.67 GB 0 fa_qk=iq5_ks Mixed manual probe: full-attention Q/K tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=iq5_ks
412 c217_ryspath_iq5k 0.8163 0.8786 0.7352 0.7979 16.69 GB 0 rys_path=iq5_k Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at iq5_k.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_k
413 c585_fa_iq5ks_bndffn_iq5ks 0.8163 0.8786 0.7352 0.7979 16.82 GB 0 fa=iq5_ks, boundary_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS entry, exit, and duplicate-boundary FFN tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq5_ks
414 c586_fa_iq5ks_bndffn_q6k 0.8163 0.8786 0.7352 0.7979 17.00 GB 0 fa=iq5_ks, boundary_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS entry, exit, and duplicate-boundary FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q6_K
415 c168_fa_iq6k 0.8163 0.8786 0.7352 0.7979 17.04 GB 0 fa=iq6_k Mixed manual probe: all full-attention Q/K/V/O tensors at iq6_k.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq6_k
416 c684_fa_iq4kt_ryssig_bf16 0.8163 0.8786 0.7352 0.7979 17.43 GB 0 fa=iq4_kt, rys_signal=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
417 c374_fa_q8r8 0.8163 0.8786 0.7352 0.7979 17.49 GB 0 fa=q8_0_r8 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0_r8
418 c522_fa_iq4ks_lateffn_bf16 0.8163 0.8786 0.7352 0.7979 17.51 GB 0 fa=iq4_ks, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_ks; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
419 c594_fa_iq5ks_lateffn_bf16 0.8163 0.8786 0.7352 0.7979 17.75 GB 0 fa=iq5_ks, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
420 c371_fa_iq4xsr8 0.8135 0.8429 0.7752 0.8048 16.48 GB 0 fa=iq4_xs_r8 Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_xs_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_xs_r8
421 c245_bndffn_iq5ks 0.8135 0.8429 0.7752 0.8048 16.66 GB 0 boundary_ffn=iq5_ks Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at iq5_ks.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=iq5_ks
422 c165_fa_iq5ks 0.8086 0.8786 0.7174 0.7879 16.72 GB 0 fa=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
423 c304_outtok_iq4kss 0.8066 0.8786 0.7130 0.7854 16.07 GB 0 output_token=iq4_kss Mixed manual probe: token embedding and output tensors at iq4_kss.
rules
^output\.weight=iq4_kss
^token_embd\.weight=iq4_kss
424 c415_faqkv_q4kr4 0.8066 0.8786 0.7130 0.7854 16.54 GB 0 fa_qkv=q4_k_r4 Mixed manual probe: full-attention Q/K/V tensors at q4_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q4_k_r4
425 c357_srcpath_iq5ks 0.8066 0.8786 0.7130 0.7854 16.61 GB 0 src_path=iq5_ks Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at iq5_ks.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=iq5_ks
426 c185_faqkv_iq5ks 0.8066 0.8786 0.7130 0.7854 16.67 GB 0 fa_qkv=iq5_ks Mixed manual probe: full-attention Q/K/V tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq5_ks
427 c402_faqkv_iq5ksr4 0.8066 0.8786 0.7130 0.7854 16.67 GB 0 fa_qkv=iq5_ks_r4 Mixed manual probe: full-attention Q/K/V tensors at iq5_ks_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=iq5_ks_r4
428 c726_fa_iq3kt_rysffn_bf16 0.8066 0.8786 0.7130 0.7854 20.06 GB 0 fa=iq3_kt, rys_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at iq3_kt; RYS 15..24 FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_kt
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=bf16
429 c314_out_iq4kss 0.7961 0.8429 0.7352 0.7823 16.15 GB 0 output_only=iq4_kss Mixed manual probe: output tensor at iq4_kss.
rules
^output\.weight=iq4_kss
430 c458_latectl_iq3kr4 0.7960 0.8786 0.6885 0.7717 16.39 GB 0 late_control=iq3_k_r4 Mixed manual probe: late attention plus FFN gate/down control tensors at iq3_k_r4.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq3_k_r4
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=iq3_k_r4
431 c487_faqk_q8kv 0.7960 0.8786 0.6885 0.7717 17.09 GB 0 fa_qk=q8_KV Mixed manual probe: full-attention Q/K tensors at q8_KV.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_KV
432 c222_ryspath_mxfp4 0.7858 0.8571 0.6930 0.7648 16.52 GB 0 rys_path=mxfp4 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at mxfp4.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=mxfp4
433 c376_fa_q8kvr8 0.7451 0.8786 0.6863 0.7704 17.37 GB 1 fa=q8_KV_r8 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_KV_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_KV_r8
434 c378_fa_q60r4 0.6776 0.9143 0.5996 0.7373 17.01 GB 2 fa=q6_0_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_0_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_0_r4
435 c383_fa_q6kr4 0.6776 0.9143 0.5996 0.7373 17.03 GB 2 fa=q6_k_r4 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_k_r4.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_k_r4
436 c486_fa_q8kv 0.5301 0.7357 0.6078 0.6637 17.37 GB 3 fa=q8_KV Mixed manual probe: all full-attention Q/K/V/O tensors at q8_KV.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_KV
437 c375_fa_q8kr8 0.5266 0.7357 0.5996 0.6592 17.38 GB 3 fa=q8_k_r8 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_k_r8.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_k_r8
438 c687_fa_iq4kt_rysffn_iq5ks 0.3271 0.6286 0.5100 0.5619 16.68 GB 5 fa=iq4_kt, rys_ffn=iq5_ks Mixed manual probe: all full-attention Q/K/V/O tensors at iq4_kt; RYS 15..24 FFN tensors at iq5_ks.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq4_kt
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=iq5_ks
439 c488_faqkv_q8kv 0.0000 0.0000 0.3774 0.2123 17.12 GB 11 fa_qkv=q8_KV Mixed manual probe: full-attention Q/K/V tensors at q8_KV.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q8_KV
440 c581_fa_iq5ks_rysffn_q8 0.0000 0.1286 0.4167 0.2906 18.06 GB 11 fa=iq5_ks, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at iq5_ks; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=iq5_ks
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0

Stable promotion-screen table

This table contains all 163 rows in the stable promotion summary used for the 163-run / 1,956-prompt accounting.

163 stable promotion-screen result rows. Sorted by weighted score descending. Selected public rows are highlighted.
RankCandidateWeightedHabitHardMeanSizeErrorsRecipe specDescription / exact rules
1 c76_fa_q6k_outtok_bf16 0.8413 0.9167 0.7394 0.8281 20.36 GB 0 fa=q6_K, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=bf16
^token_embd\.weight=bf16
2 c21_fao_q6k 0.8385 0.9167 0.7328 0.8247 16.70 GB 0 fa_o=q6_K Mixed manual probe: full-attention output tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=q6_K
3 c22_fao_q8 0.8385 0.9167 0.7328 0.8247 16.84 GB 0 fa_o=q8_0 Mixed manual probe: full-attention output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=q8_0
4 c69_fa_q5k_bndffn_q8 0.8385 0.9167 0.7328 0.8247 17.31 GB 0 fa=q5_K, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
5 c23_fao_bf16 0.8385 0.9167 0.7328 0.8247 17.37 GB 0 fa_o=bf16 Mixed manual probe: full-attention output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
6 c112_fav_bf16_fao_bf16 0.8385 0.9167 0.7328 0.8247 17.49 GB 0 fa_v=bf16, fa_o=bf16 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
7 c89_fa_q8_ryssig_q8 0.8385 0.9167 0.7328 0.8247 17.83 GB 0 fa=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
8 c114_fao_bf16_faqk_q8 0.8385 0.9167 0.7328 0.8247 17.98 GB 0 fa_o=bf16, fa_qk=q8_0 Mixed manual probe: full-attention output tensors at bf16; full-attention Q/K tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
9 c93_fa_q8_bndffn_q8 0.8385 0.9167 0.7328 0.8247 18.02 GB 0 fa=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
10 c152_fav_bf16_fao_bf16_bndffn_q8 0.8385 0.9167 0.7328 0.8247 18.03 GB 0 fa_v=bf16, fa_o=bf16, boundary_ffn=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
11 c154_fao_bf16_faqk_q8_lateffn_q8 0.8385 0.9167 0.7328 0.8247 18.34 GB 0 fa_o=bf16, fa_qk=q8_0, late_ffn=q8_0 Mixed manual probe: full-attention output tensors at bf16; full-attention Q/K tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
12 c87_fa_q8_outtok_q8 0.8385 0.9167 0.7328 0.8247 18.43 GB 0 fa=q8_0, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
13 c96_fa_q8_lateffn_bf16 0.8385 0.9167 0.7328 0.8247 18.51 GB 0 fa=q8_0, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
14 c142_fa_q8_outtok_q8_lateffn_q8 0.8385 0.9167 0.7328 0.8247 18.79 GB 0 fa=q8_0, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
15 c126_bndall_bf16_lateffn_q8 0.8385 0.9167 0.7328 0.8247 19.05 GB 0 boundary_all=bf16, late_ffn=q8_0 Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at bf16; late FFN gate/down control tensors at q8_0.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
16 c05_all_fullattn_bf16 0.8385 0.9167 0.7328 0.8247 19.26 GB 0 ^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16 Full-attention ceiling probe: all full-attention Q/K/V/O tensors remain BF16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
17 c101_fa_bf16_ryssig_q8 0.8385 0.9167 0.7328 0.8247 19.60 GB 0 fa=bf16, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
18 c56_outtok_bf16 0.8385 0.9167 0.7328 0.8247 19.88 GB 0 output_token=bf16 Mixed manual probe: token embedding and output tensors at bf16.
rules
^output\.weight=bf16
^token_embd\.weight=bf16
19 c64_fa_q5k_outtok_bf16 0.8385 0.9167 0.7328 0.8247 20.11 GB 0 fa=q5_K, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=bf16
^token_embd\.weight=bf16
20 c99_fa_bf16_outtok_q8 0.8385 0.9167 0.7328 0.8247 20.20 GB 0 fa=bf16, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
21 c102_fa_bf16_ryssig_bf16 0.8385 0.9167 0.7328 0.8247 20.26 GB 0 fa=bf16, rys_signal=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
22 c148_fa_bf16_outtok_q8_lateffn_q8 0.8385 0.9167 0.7328 0.8247 20.56 GB 0 fa=bf16, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
23 c147_fa_bf16_outtok_q8_bndffn_q8 0.8385 0.9167 0.7328 0.8247 20.73 GB 0 fa=bf16, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
24 c88_fa_q8_outtok_bf16 0.8385 0.9167 0.7328 0.8247 20.81 GB 0 fa=q8_0, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=bf16
^token_embd\.weight=bf16
25 c100_fa_bf16_outtok_bf16 0.8385 0.9167 0.7328 0.8247 22.58 GB 0 fa=bf16, output_token=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=bf16
^token_embd\.weight=bf16
26 c06_output_token_q8_fullattn_q8 0.8357 0.9167 0.7261 0.8214 18.43 GB 0 ^output\.weight=q8_0; ^token_embd\.weight=q8_0; ^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0 Protect expression/input surfaces plus full-attention routing: token/output Q8_0 and full-attention Q/K/V/O Q8_0.
rules
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
27 c106_fa_bf16_bndffn_bf16 0.8333 0.9167 0.7206 0.8186 20.79 GB 0 fa=bf16, boundary_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
28 c35_bndall_q6k 0.8243 0.9167 0.6994 0.8081 16.94 GB 0 boundary_all=q6_K Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at q6_K.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q6_K
29 c111_fav_bf16_fao_q8 0.8243 0.9167 0.6994 0.8081 16.96 GB 0 fa_v=bf16, fa_o=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=q8_0
30 c03_habit_boundary_q8 0.8243 0.9167 0.6994 0.8081 17.30 GB 0 ^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0 Protect the RYS entry/exit boundaries and duplicated boundary layers with Q8_0 across attention, SSM, and FFN tensors.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0
31 c45_lateattn_bf16 0.8243 0.9167 0.6994 0.8081 17.31 GB 0 late_attn=bf16 Mixed manual probe: late full-attention control tensors at bf16.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
32 c77_fa_q6k_ryssig_q8 0.8243 0.9167 0.6994 0.8081 17.38 GB 0 fa=q6_K, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
33 c83_fa_q6k_lateffn_q8 0.8243 0.9167 0.6994 0.8081 17.39 GB 0 fa=q6_K, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
34 c63_fa_q5k_outtok_q8 0.8243 0.9167 0.6994 0.8081 17.72 GB 0 fa=q5_K, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
35 c15_faqk_bf16 0.8243 0.9167 0.6994 0.8081 18.32 GB 0 fa_qk=bf16 Mixed manual probe: full-attention Q/K tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=bf16
36 c151_fav_bf16_fao_bf16_outtok_q8 0.8243 0.9167 0.6994 0.8081 18.44 GB 0 fa_v=bf16, fa_o=bf16, output_token=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention output tensors at bf16; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_output\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
37 c141_fa_q8_outtok_q8_bndffn_q8 0.8243 0.9167 0.6994 0.8081 18.97 GB 0 fa=q8_0, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
38 c105_fa_bf16_bndffn_q8 0.8243 0.9167 0.6994 0.8081 19.79 GB 0 fa=bf16, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
39 c145_fa_bf16_outtok_q8_ryssig_q8 0.8243 0.9167 0.6994 0.8081 20.55 GB 0 fa=bf16, output_token=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
40 c01_rys_window_path_q8 0.8174 0.8750 0.7394 0.8072 17.11 GB 0 ^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0 Protect the original plus duplicated RYS 15..24 path on attention, linear-attention, and SSM tensors with Q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
41 c09_union_rys_path_q8_late_control_q8 0.8174 0.8750 0.7394 0.8072 17.72 GB 0 ^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0; ^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0; ^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0 Union of RYS-window path retention and late execution-control retention.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
42 c18_fav_q6k 0.8146 0.8750 0.7328 0.8039 16.57 GB 0 fa_v=q6_K Mixed manual probe: full-attention V tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=q6_K
43 c20_fav_bf16 0.8146 0.8750 0.7328 0.8039 16.68 GB 0 fa_v=bf16 Mixed manual probe: full-attention V tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
44 c27_srcsig_q8 0.8146 0.8750 0.7328 0.8039 16.73 GB 0 src_signal=q8_0 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
45 c46_lateffn_q6k 0.8146 0.8750 0.7328 0.8039 16.74 GB 0 late_ffn=q6_K Mixed manual probe: late FFN gate/down control tensors at q6_K.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q6_K
46 c61_earlyattn_q8 0.8146 0.8750 0.7328 0.8039 16.76 GB 0 early_attn=q8_0 Mixed manual probe: early full-attention routing tensors at q8_0.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=q8_0
47 c37_bndffn_q6k 0.8146 0.8750 0.7328 0.8039 16.83 GB 0 boundary_ffn=q6_K Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at q6_K.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q6_K
48 c31_srcpath_q8 0.8146 0.8750 0.7328 0.8039 16.83 GB 0 src_path=q8_0 Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
49 c33_duppath_q8 0.8146 0.8750 0.7328 0.8039 16.83 GB 0 dup_path=q8_0 Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at q8_0.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
50 c57_out_q8 0.8146 0.8750 0.7328 0.8039 16.86 GB 0 output_only=q8_0 Mixed manual probe: output tensor at q8_0.
rules
^output\.weight=q8_0
51 c49_latectl_q6k 0.8146 0.8750 0.7328 0.8039 16.87 GB 0 late_control=q6_K Mixed manual probe: late attention plus FFN gate/down control tensors at q6_K.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q6_K
52 c13_faqk_q6k 0.8146 0.8750 0.7328 0.8039 16.87 GB 0 fa_qk=q6_K Mixed manual probe: full-attention Q/K tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q6_K
53 c54_outtok_q6k 0.8146 0.8750 0.7328 0.8039 16.88 GB 0 output_token=q6_K Mixed manual probe: token embedding and output tensors at q6_K.
rules
^output\.weight=q6_K
^token_embd\.weight=q6_K
54 c25_ryssig_q8 0.8146 0.8750 0.7328 0.8039 16.90 GB 0 rys_signal=q8_0 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
55 c47_lateffn_q8 0.8146 0.8750 0.7328 0.8039 16.91 GB 0 late_ffn=q8_0 Mixed manual probe: late FFN gate/down control tensors at q8_0.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
56 c52_finalffn_q8 0.8146 0.8750 0.7328 0.8039 16.96 GB 0 final_ffn=q8_0 Mixed manual probe: final FFN tensors at q8_0.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
57 c12_fa_q6k 0.8146 0.8750 0.7328 0.8039 17.03 GB 0 fa=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
58 c28_srcsig_bf16 0.8146 0.8750 0.7328 0.8039 17.05 GB 0 src_signal=bf16 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at bf16.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
59 c65_fa_q5k_ryssig_q8 0.8146 0.8750 0.7328 0.8039 17.13 GB 0 fa=q5_K, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
60 c62_earlyattn_bf16 0.8146 0.8750 0.7328 0.8039 17.16 GB 0 early_attn=bf16 Mixed manual probe: early full-attention routing tensors at bf16.
rules
^blk\.(3|7|11|15)\.attn_(q|k|v|output)\.weight=bf16
61 c14_faqk_q8 0.8146 0.8750 0.7328 0.8039 17.17 GB 0 fa_qk=q8_0 Mixed manual probe: full-attention Q/K tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
62 c73_fa_q5k_finalffn_q8 0.8146 0.8750 0.7328 0.8039 17.18 GB 0 fa=q5_K, final_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; final FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
63 c16_faqkv_q8 0.8146 0.8750 0.7328 0.8039 17.20 GB 0 fa_qkv=q8_0 Mixed manual probe: full-attention Q/K/V tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=q8_0
64 c116_srcsig_q8_dupsig_bf16 0.8146 0.8750 0.7328 0.8039 17.23 GB 0 src_signal=q8_0, dup_signal=bf16 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at q8_0; duplicated-side RYS 20..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
65 c113_fav_bf16_faqk_q8 0.8146 0.8750 0.7328 0.8039 17.29 GB 0 fa_v=bf16, fa_qk=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention Q/K tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
66 c32_srcpath_bf16 0.8146 0.8750 0.7328 0.8039 17.35 GB 0 src_path=bf16 Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at bf16.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
67 c85_fa_q6k_finalffn_q8 0.8146 0.8750 0.7328 0.8039 17.43 GB 0 fa=q6_K, final_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; final FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
68 c67_fa_q5k_rysffn_q6k 0.8146 0.8750 0.7328 0.8039 17.47 GB 0 fa=q5_K, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
69 c131_fa_q5k_ryssig_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.48 GB 0 fa=q5_K, rys_signal=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS 15..24 linear-attention and SSM signal path at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
70 c02_all_fullattn_q8 0.8146 0.8750 0.7328 0.8039 17.49 GB 0 ^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0 Protect all full-attention Q/K/V/O tensors with Q8_0, leaving recurrent and FFN tensors at IQ4_NL mix defaults.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
71 c55_outtok_q8 0.8146 0.8750 0.7328 0.8039 17.50 GB 0 output_token=q8_0 Mixed manual probe: token embedding and output tensors at q8_0.
rules
^output\.weight=q8_0
^token_embd\.weight=q8_0
72 c26_ryssig_bf16 0.8146 0.8750 0.7328 0.8039 17.55 GB 0 rys_signal=bf16 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
73 c81_fa_q6k_bndffn_q8 0.8146 0.8750 0.7328 0.8039 17.56 GB 0 fa=q6_K, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
74 c48_lateffn_bf16 0.8146 0.8750 0.7328 0.8039 17.58 GB 0 late_ffn=bf16 Mixed manual probe: late FFN gate/down control tensors at bf16.
rules
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
75 c156_srcsig_bf16_dupsig_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.58 GB 0 src_signal=bf16, dup_signal=q8_0, late_ffn=q8_0 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at bf16; duplicated-side RYS 20..24 linear-attention and SSM signal path at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
76 c117_srcpath_bf16_duppath_q8 0.8146 0.8750 0.7328 0.8039 17.63 GB 0 src_path=bf16, dup_path=q8_0 Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at bf16; duplicated-side RYS 20..24 attention, linear-attention, and SSM path at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
77 c118_srcpath_q8_duppath_bf16 0.8146 0.8750 0.7328 0.8039 17.63 GB 0 src_path=q8_0, dup_path=bf16 Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at q8_0; duplicated-side RYS 20..24 attention, linear-attention, and SSM path at bf16.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
78 c125_bndall_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.65 GB 0 boundary_all=q8_0, late_ffn=q8_0 Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
79 c121_lateattn_bf16_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.66 GB 0 late_attn=bf16, late_ffn=q8_0 Mixed manual probe: late full-attention control tensors at bf16; late FFN gate/down control tensors at q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
80 c53_finalffn_bf16 0.8146 0.8750 0.7328 0.8039 17.71 GB 0 final_ffn=bf16 Mixed manual probe: final FFN tensors at bf16.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
81 c79_fa_q6k_rysffn_q6k 0.8146 0.8750 0.7328 0.8039 17.72 GB 0 fa=q6_K, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
82 c137_fa_q6k_ryssig_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.73 GB 0 fa=q6_K, rys_signal=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS 15..24 linear-attention and SSM signal path at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
83 c72_fa_q5k_lateffn_bf16 0.8146 0.8750 0.7328 0.8039 17.80 GB 0 fa=q5_K, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
84 c122_lateattn_q8_lateffn_bf16 0.8146 0.8750 0.7328 0.8039 17.84 GB 0 late_attn=q8_0, late_ffn=bf16 Mixed manual probe: late full-attention control tensors at q8_0; late FFN gate/down control tensors at bf16.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
85 c95_fa_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.84 GB 0 fa=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
86 c97_fa_q8_finalffn_q8 0.8146 0.8750 0.7328 0.8039 17.89 GB 0 fa=q8_0, final_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; final FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
87 c41_rysffn_q8 0.8146 0.8750 0.7328 0.8039 17.89 GB 0 rys_ffn=q8_0 Mixed manual probe: RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
88 c138_fa_q6k_bndffn_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 17.92 GB 0 fa=q6_K, boundary_ffn=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
89 c74_fa_q5k_finalffn_bf16 0.8146 0.8750 0.7328 0.8039 17.93 GB 0 fa=q5_K, final_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; final FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
90 c75_fa_q6k_outtok_q8 0.8146 0.8750 0.7328 0.8039 17.97 GB 0 fa=q6_K, output_token=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
91 c78_fa_q6k_ryssig_bf16 0.8146 0.8750 0.7328 0.8039 18.03 GB 0 fa=q6_K, rys_signal=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
92 c84_fa_q6k_lateffn_bf16 0.8146 0.8750 0.7328 0.8039 18.05 GB 0 fa=q6_K, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
93 c58_out_bf16 0.8146 0.8750 0.7328 0.8039 18.05 GB 0 output_only=bf16 Mixed manual probe: output tensor at bf16.
rules
^output\.weight=bf16
94 c127_fa_q5k_outtok_q8_ryssig_q8 0.8146 0.8750 0.7328 0.8039 18.07 GB 0 fa=q5_K, output_token=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
95 c39_bndffn_bf16 0.8146 0.8750 0.7328 0.8039 18.09 GB 0 boundary_ffn=bf16 Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
96 c123_latectl_q8_outtok_q8 0.8146 0.8750 0.7328 0.8039 18.11 GB 0 late_control=q8_0, output_token=q8_0 Mixed manual probe: late attention plus FFN gate/down control tensors at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
97 c68_fa_q5k_rysffn_q8 0.8146 0.8750 0.7328 0.8039 18.12 GB 0 fa=q5_K, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
98 c04_rys_window_path_bf16 0.8146 0.8750 0.7328 0.8039 18.15 GB 0 ^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16 Upper probe for habit retention: keep the RYS 15..24 attention/linear/SSM path at BF16.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
99 c91_fa_q8_rysffn_q6k 0.8146 0.8750 0.7328 0.8039 18.18 GB 0 fa=q8_0, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
100 c143_fa_q8_ryssig_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 18.19 GB 0 fa=q8_0, rys_signal=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
101 c153_fav_bf16_faqk_q8_outtok_q8 0.8146 0.8750 0.7328 0.8039 18.24 GB 0 fa_v=bf16, fa_qk=q8_0, output_token=q8_0 Mixed manual probe: full-attention V tensors at bf16; full-attention Q/K tensors at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=bf16
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
102 c136_fa_q6k_outtok_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 18.33 GB 0 fa=q6_K, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
103 c144_fa_q8_bndffn_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 18.38 GB 0 fa=q8_0, boundary_ffn=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
104 c17_faqkv_bf16 0.8146 0.8750 0.7328 0.8039 18.44 GB 0 fa_qkv=bf16 Mixed manual probe: full-attention Q/K/V tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v)\.weight=bf16
105 c07_rys_window_all_q8 0.8146 0.8750 0.7328 0.8039 18.45 GB 0 ^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0 Broad local recovery: the full RYS 15..24 window gets Q8_0 on attention, SSM, and FFN tensors.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=q8_0
106 c90_fa_q8_ryssig_bf16 0.8146 0.8750 0.7328 0.8039 18.49 GB 0 fa=q8_0, rys_signal=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
107 c135_fa_q6k_outtok_q8_bndffn_q8 0.8146 0.8750 0.7328 0.8039 18.51 GB 0 fa=q6_K, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
108 c82_fa_q6k_bndffn_bf16 0.8146 0.8750 0.7328 0.8039 18.57 GB 0 fa=q6_K, boundary_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
109 c98_fa_q8_finalffn_bf16 0.8146 0.8750 0.7328 0.8039 18.64 GB 0 fa=q8_0, final_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; final FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
110 c36_bndall_bf16 0.8146 0.8750 0.7328 0.8039 18.69 GB 0 boundary_all=bf16 Mixed manual probe: RYS entry, exit, and duplicate-boundary attention/SSM/FFN tensors at bf16.
rules
^blk\.(15|19|20|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta)|ffn_(down|gate|up))\.weight=bf16
111 c10_union_rys_path_bf16_late_control_q8 0.8146 0.8750 0.7328 0.8039 18.77 GB 0 ^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16; ^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0; ^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0 Heavier union: RYS 15..24 pathway BF16 with late execution-control Q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
112 c92_fa_q8_rysffn_q8 0.8146 0.8750 0.7328 0.8039 18.82 GB 0 fa=q8_0, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
113 c94_fa_q8_bndffn_bf16 0.8146 0.8750 0.7328 0.8039 19.02 GB 0 fa=q8_0, boundary_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
114 c128_fa_q5k_outtok_q8_rysffn_q8 0.8146 0.8750 0.7328 0.8039 19.06 GB 0 fa=q5_K, output_token=q8_0, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at q8_0; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
115 c134_fa_q6k_outtok_q8_rysffn_q8 0.8146 0.8750 0.7328 0.8039 19.31 GB 0 fa=q6_K, output_token=q8_0, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at q8_0; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
116 c107_fa_bf16_lateffn_q8 0.8146 0.8750 0.7328 0.8039 19.61 GB 0 fa=bf16, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
117 c109_fa_bf16_finalffn_q8 0.8146 0.8750 0.7328 0.8039 19.66 GB 0 fa=bf16, final_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; final FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q8_0
118 c103_fa_bf16_rysffn_q6k 0.8146 0.8750 0.7328 0.8039 19.95 GB 0 fa=bf16, rys_ffn=q6_K Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
119 c150_fa_bf16_bndffn_q8_lateffn_q8 0.8146 0.8750 0.7328 0.8039 20.15 GB 0 fa=bf16, boundary_ffn=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
120 c108_fa_bf16_lateffn_bf16 0.8146 0.8750 0.7328 0.8039 20.28 GB 0 fa=bf16, late_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; late FFN gate/down control tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
121 c42_rysffn_bf16 0.8146 0.8750 0.7328 0.8039 20.40 GB 0 rys_ffn=bf16 Mixed manual probe: RYS 15..24 FFN tensors at bf16.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=bf16
122 c110_fa_bf16_finalffn_bf16 0.8146 0.8750 0.7328 0.8039 20.41 GB 0 fa=bf16, final_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; final FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
123 c104_fa_bf16_rysffn_q8 0.8146 0.8750 0.7328 0.8039 20.59 GB 0 fa=bf16, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
124 c146_fa_bf16_outtok_q8_rysffn_q8 0.8146 0.8750 0.7328 0.8039 21.54 GB 0 fa=bf16, output_token=q8_0, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; token embedding and output tensors at q8_0; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
125 c51_finalffn_q6k 0.8117 0.8750 0.7261 0.8006 16.76 GB 0 final_ffn=q6_K Mixed manual probe: final FFN tensors at q6_K.
rules
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=q6_K
126 c86_fa_q6k_finalffn_bf16 0.8032 0.8750 0.7061 0.7906 18.18 GB 0 fa=q6_K, final_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; final FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(60|64|68)\.ffn_(down|gate|up)\.weight=bf16
127 c140_fa_q8_outtok_q8_rysffn_q8 0.8032 0.8750 0.7061 0.7906 19.77 GB 0 fa=q8_0, output_token=q8_0, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
128 c40_rysffn_q6k 0.8031 0.8550 0.7328 0.7939 17.24 GB 0 rys_ffn=q6_K Mixed manual probe: RYS 15..24 FFN tensors at q6_K.
rules
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q6_K
129 c19_fav_q8 0.8004 0.8750 0.6994 0.7872 16.59 GB 0 fa_v=q8_0 Mixed manual probe: full-attention V tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_v\.weight=q8_0
130 c43_lateattn_q6k 0.8004 0.8750 0.6994 0.7872 16.69 GB 0 late_attn=q6_K Mixed manual probe: late full-attention control tensors at q6_K.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
131 c29_dupsig_q8 0.8004 0.8750 0.6994 0.7872 16.73 GB 0 dup_signal=q8_0 Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
132 c24_ryssig_q6k 0.8004 0.8750 0.6994 0.7872 16.73 GB 0 rys_signal=q6_K Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q6_K.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q6_K
133 c11_fa_q5k 0.8004 0.8750 0.6994 0.7872 16.78 GB 0 fa=q5_K Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
134 c44_lateattn_q8 0.8004 0.8750 0.6994 0.7872 16.81 GB 0 late_attn=q8_0 Mixed manual probe: late full-attention control tensors at q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
135 c30_dupsig_bf16 0.8004 0.8750 0.6994 0.7872 17.05 GB 0 dup_signal=bf16 Mixed manual probe: duplicated-side RYS 20..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
136 c38_bndffn_q8 0.8004 0.8750 0.6994 0.7872 17.09 GB 0 boundary_ffn=q8_0 Mixed manual probe: RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
137 c71_fa_q5k_lateffn_q8 0.8004 0.8750 0.6994 0.7872 17.14 GB 0 fa=q5_K, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
138 c08_late_control_q8 0.8004 0.8750 0.6994 0.7872 17.17 GB 0 ^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0; ^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0 Probe late execution-control retention: final five full-attention blocks plus late FFN gate/down tensors at Q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
139 c59_tok_q8 0.8004 0.8750 0.6994 0.7872 17.19 GB 0 token_only=q8_0 Mixed manual probe: token embedding tensor at q8_0.
rules
^token_embd\.weight=q8_0
140 c115_srcsig_bf16_dupsig_q8 0.8004 0.8750 0.6994 0.7872 17.23 GB 0 src_signal=bf16, dup_signal=q8_0 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at bf16; duplicated-side RYS 20..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
141 c34_duppath_bf16 0.8004 0.8750 0.6994 0.7872 17.35 GB 0 dup_path=bf16 Mixed manual probe: duplicated-side RYS 20..24 attention, linear-attention, and SSM path at bf16.
rules
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
142 c132_fa_q5k_bndffn_q8_lateffn_q8 0.8004 0.8750 0.6994 0.7872 17.67 GB 0 fa=q5_K, boundary_ffn=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
143 c66_fa_q5k_ryssig_bf16 0.8004 0.8750 0.6994 0.7872 17.78 GB 0 fa=q5_K, rys_signal=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS 15..24 linear-attention and SSM signal path at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
144 c130_fa_q5k_outtok_q8_lateffn_q8 0.8004 0.8750 0.6994 0.7872 18.08 GB 0 fa=q5_K, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
145 c155_srcsig_bf16_dupsig_q8_outtok_q8 0.8004 0.8750 0.6994 0.7872 18.17 GB 0 src_signal=bf16, dup_signal=q8_0, output_token=q8_0 Mixed manual probe: source-side RYS 15..19 linear-attention and SSM signal path at bf16; duplicated-side RYS 20..24 linear-attention and SSM signal path at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(20|21|22|23|24)\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
146 c129_fa_q5k_outtok_q8_bndffn_q8 0.8004 0.8750 0.6994 0.7872 18.26 GB 0 fa=q5_K, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
147 c133_fa_q6k_outtok_q8_ryssig_q8 0.8004 0.8750 0.6994 0.7872 18.32 GB 0 fa=q6_K, output_token=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; token embedding and output tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
148 c50_latectl_bf16 0.8004 0.8750 0.6994 0.7872 18.33 GB 0 late_control=bf16 Mixed manual probe: late attention plus FFN gate/down control tensors at bf16.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
149 c80_fa_q6k_rysffn_q8 0.8004 0.8750 0.6994 0.7872 18.37 GB 0 fa=q6_K, rys_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q6_K; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q6_K
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
150 c60_tok_bf16 0.8004 0.8750 0.6994 0.7872 18.38 GB 0 token_only=bf16 Mixed manual probe: token embedding tensor at bf16.
rules
^token_embd\.weight=bf16
151 c160_ryspath_q8_outtok_q8_lateffn_q8 0.8004 0.8750 0.6994 0.7872 18.41 GB 0 rys_path=q8_0, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at q8_0; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
152 c157_srcpath_bf16_duppath_q8_outtok_q8 0.8004 0.8750 0.6994 0.7872 18.58 GB 0 src_path=bf16, dup_path=q8_0, output_token=q8_0 Mixed manual probe: source-side RYS 15..19 attention, linear-attention, and SSM path at bf16; duplicated-side RYS 20..24 attention, linear-attention, and SSM path at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(15|16|17|18|19)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(20|21|22|23|24)\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
153 c161_lateattn_bf16_lateffn_q8_outtok_q8 0.8004 0.8750 0.6994 0.7872 18.61 GB 0 late_attn=bf16, late_ffn=q8_0, output_token=q8_0 Mixed manual probe: late full-attention control tensors at bf16; late FFN gate/down control tensors at q8_0; token embedding and output tensors at q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
154 c162_latectl_q8_outtok_q8_bndffn_q8 0.8004 0.8750 0.6994 0.7872 18.65 GB 0 late_control=q8_0, output_token=q8_0, boundary_ffn=q8_0 Mixed manual probe: late attention plus FFN gate/down control tensors at q8_0; token embedding and output tensors at q8_0; RYS entry, exit, and duplicate-boundary FFN tensors at q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=q8_0
155 c139_fa_q8_outtok_q8_ryssig_q8 0.8004 0.8750 0.6994 0.7872 18.78 GB 0 fa=q8_0, output_token=q8_0, rys_signal=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at q8_0; token embedding and output tensors at q8_0; RYS 15..24 linear-attention and SSM signal path at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q8_0
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
156 c120_ryssig_bf16_rysffn_q8 0.8004 0.8750 0.6994 0.7872 18.89 GB 0 rys_signal=bf16, rys_ffn=q8_0 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at bf16; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
157 c124_latectl_bf16_outtok_q8 0.8004 0.8750 0.6994 0.7872 19.27 GB 0 late_control=bf16, output_token=q8_0 Mixed manual probe: late attention plus FFN gate/down control tensors at bf16; token embedding and output tensors at q8_0.
rules
^blk\.(52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
158 c159_ryspath_bf16_outtok_q8_lateffn_q8 0.8004 0.8750 0.6994 0.7872 19.45 GB 0 rys_path=bf16, output_token=q8_0, late_ffn=q8_0 Mixed manual probe: RYS 15..24 attention, linear-attention, and SSM path at bf16; token embedding and output tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_(q|k|v|output)|attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^output\.weight=q8_0
^token_embd\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
159 c149_fa_bf16_ryssig_q8_lateffn_q8 0.8004 0.8750 0.6994 0.7872 19.96 GB 0 fa=bf16, rys_signal=q8_0, late_ffn=q8_0 Mixed manual probe: all full-attention Q/K/V/O tensors at bf16; RYS 15..24 linear-attention and SSM signal path at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
160 release_s010_iq4nl 0.7954 0.8417 0.7328 0.7872 16.55 GB 0
161 c70_fa_q5k_bndffn_bf16 0.7952 0.8750 0.6872 0.7811 18.32 GB 0 fa=q5_K, boundary_ffn=bf16 Mixed manual probe: all full-attention Q/K/V/O tensors at q5_K; RYS entry, exit, and duplicate-boundary FFN tensors at bf16.
rules
^blk\.(3|7|11|15|19|20|24|28|32|36|40|44|48|52|56|60|64|68)\.attn_(q|k|v|output)\.weight=q5_K
^blk\.(15|19|20|24)\.ffn_(down|gate|up)\.weight=bf16
162 c158_ryssig_bf16_rysffn_q8_lateffn_q8 0.7906 0.8333 0.7328 0.7831 19.25 GB 0 rys_signal=bf16, rys_ffn=q8_0, late_ffn=q8_0 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at bf16; RYS 15..24 FFN tensors at q8_0; late FFN gate/down control tensors at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=bf16
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0
^blk\.(56|60|64|68)\.ffn_(down|gate)\.weight=q8_0
163 c119_ryssig_q8_rysffn_q8 0.7889 0.8550 0.6994 0.7772 18.24 GB 0 rys_signal=q8_0, rys_ffn=q8_0 Mixed manual probe: RYS 15..24 linear-attention and SSM signal path at q8_0; RYS 15..24 FFN tensors at q8_0.
rules
^blk\.(1[5-9]|2[0-4])\.(attn_qkv|attn_gate|ssm_(out|alpha|beta))\.weight=q8_0
^blk\.(1[5-9]|2[0-4])\.ffn_(down|gate|up)\.weight=q8_0

Source accounting reproduced on page

DatasetRowsWhat it represents
Stable promotion summary163Candidate rows from the stable 12-task hard-suite promotion screen.
Comprehensive summary440Later comprehensive candidate rows used for broader analysis and candidate-family comparison.
KritaLite11Real-world browser paint application harness rows.
SyncBoard8Real-world local-first board/state-management harness rows.
Legacy-codebase migration13Real-world existing-codebase memory migration harness rows.
Candidate catalog882Generated candidate recipe definitions used to annotate tested rows. Untested definitions are not counted as results.