trueskill-tt

Author	SHA1	Message	Date
Anders Olsson	244b94a3e5	feat(api): typed add_events(iter); generify internal path over T Public API gains: History::add_events<I: IntoIterator<Item = Event<T, K>>>(events) -> Result<(), InferenceError> which accepts the typed Event<T, K> shape added in Task 10. Ranks from Outcome::Ranked are mapped to the legacy "higher f64 = better" results internally. add_events_with_prior now takes Vec<T> for times (was Vec<i64>), generifying the whole internal path over T in a single fully-generic impl<T: Time, D: Drift<T>, O: Observer<T>, K> block. The i64-specific block is gone; record_winner/record_draw are now generic over T. add_events_with_prior stays pub (not pub(crate)) because the ATP example calls it directly with pre-built Index-based composition; the new typed add_events is the primary public API going forward. In-crate tests updated to call add_events_with_prior with an empty HashMap. tests/api_shape.rs added with 3 integration tests covering bulk ingest, draw, and mismatched-outcome error. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 12:39:46 +02:00
Anders Olsson	044fb83a38	feat(api): add record_winner, record_draw, intern, lookup on History Spec Section 4 "three-tier event ingestion" tier 2: one-off match convenience. Spec open question 3: expose Index + intern/lookup for power users. History and HistoryBuilder gain a 4th generic parameter K: Eq + Hash + Clone = &'static str. The default ensures existing tests using Index-based add_events compile unchanged. History internally owns a KeyTable<K>. intern(&Q) creates or returns an Index for the given key; lookup(&Q) returns Option<Index> without creating. record_winner and record_draw are thin 1v1 wrappers around the internal add_events_with_prior. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 12:30:04 +02:00
Anders Olsson	a83c9acacb	feat(error): expand InferenceError; convert boundary asserts to Result InferenceError gains MismatchedShape (user-input length mismatches), InvalidProbability (p_draw out of [0, 1]), and ConvergenceFailed (exceeded max_iter without hitting epsilon). NegativePrecision stays. History::add_events_with_prior and History::add_events now return Result<(), InferenceError>. The previous assert! macros checking composition/results/times/weights shape are replaced by matched error returns. Internal debug_assert! macros for arithmetic invariants stay; this change only affects boundary validation of user input. Tests updated to call .unwrap() on the Result. The old signatures will be fully replaced in Task 15 (typed add_events(iter)) and the nested-Vec wrapper removed in Task 20. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 12:26:13 +02:00
Anders Olsson	a6e008f8ff	feat(api): add ConvergenceOptions, ConvergenceReport, History::converge New public types: - ConvergenceOptions { max_iter, epsilon } — config for the loop - ConvergenceReport { iterations, final_step, log_evidence, converged, per_iteration_time, slices_skipped } — post-hoc summary History and HistoryBuilder gain a third generic parameter O: Observer<T> = NullObserver. Builder methods: - .convergence(opts) sets the ConvergenceOptions - .observer(o) plugs in an Observer (reshapes the builder's O param) History::converge() runs the existing iteration loop driven by the stored opts, emits observer callbacks on each iteration end and on completion, and returns Result<ConvergenceReport, InferenceError>. The old convergence(iters, eps, verbose) stays — gets removed in Task 20 after tests are translated. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 12:20:24 +02:00
Anders Olsson	726896a2ba	feat(api): add Observer trait and NullObserver default Observer replaces verbose: bool with structured progress callbacks: on_iteration_end, on_batch_processed, on_converged — all no-op default impls so users override only what they need. NullObserver is a ZST default. Send + Sync bounds deferred to T3 (Rayon support). Fully additive — wired into History::converge in Task 12. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 12:16:25 +02:00
Anders Olsson	f5a486329e	feat(api): add Event<T, K>, Team<K>, Member<K> typed event description Replaces the old nested Vec<Vec<Vec<_>>> event description on the public API boundary. Member<K>::from(K) enables ergonomic literal lists. Member::with_weight / with_prior are builder methods for the optional per-event overrides. Fully additive — no existing call sites updated. Consumed by History::add_events(iter) in Task 15. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 12:14:58 +02:00
Anders Olsson	3df422db78	feat(api): add Outcome enum with Ranked variant Outcome::winner(i, n), Outcome::draw(n), Outcome::ranking(iter) are the convenience constructors. Marked #[non_exhaustive] so Scored can be added in T4 without breaking match exhaustiveness. Adds smallvec = "1" as a direct dependency. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 12:12:53 +02:00
Anders Olsson	33a7d90b89	refactor(history): remove time: bool; translate tests to explicit timestamps The bool encoded 'no time axis' which is now expressed at the type level (T = Untimed). The old !self.time branch generated sequential i64 timestamps internally (1..=n) and bumped all agents' last_time at every tick; tests that relied on this now pass those timestamps explicitly and reflect the correct time=true elapsed semantics. Collapsed `if self.time { A } else { B }` into the A branch everywhere in add_events_with_prior. Removed the two !self.time blocks that updated all agents' last_time at every slice regardless of participation. sort_time is now generic over `T: Copy + Ord`. HistoryBuilder::time(bool) removed. History<i64, ConstantDrift> default remains, producing the same behavior as old .time(true). The test_env_ttt Gaussian goldens are updated to reflect the correct time=true semantics (b.elapsed=2 instead of 1 due to b skipping t=2); this is a correction: the old !self.time last_time bump was an implementation quirk that diverged from the Python reference. 55 tests pass. clippy clean. fmt clean. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 12:09:23 +02:00
Anders Olsson	59e4cb35cc	refactor(api): generify Drift, Rating, Competitor, TimeSlice, CompetitorStore, History over T: Time Drift now takes &T -> &T and is generic over the time axis. Untimed impls return elapsed=0. ConstantDrift impl covers all T via the Time trait. An additional variance_for_elapsed(i64) method on the trait serves callers that work with the pre-cached i64 elapsed count. Competitor.last_time moves from i64 with MIN sentinel to Option<T> with None sentinel. receive(&T) computes variance from last_time dynamically; receive_for_elapsed(i64) uses a pre-cached elapsed count (needed in convergence sweeps where last_time has already advanced). TimeSlice.time changes from i64 to T. compute_elapsed is now generic over T and takes Option<&T> for the last-seen time. new_forward_info uses receive_for_elapsed to preserve the cached elapsed during sweeps. History<D> becomes History<T, D>; HistoryBuilder<D> becomes HistoryBuilder<T, D>; Game<D> becomes Game<T, D>. Defaults keep existing call sites compiling with zero changes: T = i64, D = ConstantDrift. add_events / add_events_with_prior stay on impl History<i64, D> since times: Vec<i64> is i64-specific (Task 8 will generalise this). In !self.time mode the old i64::MAX sentinel guaranteed elapsed=1 for every slice transition regardless of time gaps. Replaced by advancing all previously-seen agents' last_time to Some(current_slice_time) at the end of each slice; this preserves elapsed=1 between adjacent slices in sequential-integer untimed mode. The time: bool field on History and .time(bool) on HistoryBuilder are NOT removed by this task — deferred to Task 8 so this commit is purely a type-level generification. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 11:50:35 +02:00
Anders Olsson	a285c1a0f2	feat(api): add Time trait with Untimed and i64 impls Foundation for generic History time axis. Untimed is the ZST case (no drift across slices); i64 is the standard timestamp case. Additional impls (time::OffsetDateTime, chrono) can be added behind feature flags in follow-up work. The trait is not yet wired into History — that happens in Task 7 along with generifying Drift over T. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 11:32:38 +02:00
Anders Olsson	5e752f9e98	refactor(api): rename Batch to TimeSlice TimeSlice says what it is: every event sharing one timestamp. The History field .batches is renamed to .time_slices. Local variables named `batch` referring to TimeSlice instances are renamed to `time_slice`. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 10:54:31 +02:00
Anders Olsson	decbd895a3	refactor(api): rename Agent to Competitor and .player field to .rating Competitor holds dynamic per-history state (message, last_time) for someone competing; its configuration lives in a Rating. AgentStore renamed to CompetitorStore to match. The internal `clean()` free function's parameter name changed from `agents` to `competitors` for consistency. Local variable names (agent_idx, this_agent) inside history.rs are left unchanged — they represent abstract identifiers, not Competitor instances. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 10:48:50 +02:00
Anders Olsson	88d54cb9f4	docs(factor): update stale Player reference to Rating Follow-up to the Player→Rating rename (`2f5aa98`); a doc comment in team_sum.rs still referenced Player::performance().	2026-04-24 10:44:26 +02:00
Anders Olsson	2f5aa98eac	refactor(api): rename Player to Rating The struct holds prior/beta/drift — a rating configuration, not a person. The person-with-temporal-state is the Competitor (renamed in the next task). Resolves Player/Agent ambiguity. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:43:19 +02:00
Anders Olsson	52f5f76a34	refactor(lib): make key_table module private; revert bench var rename Address code review feedback from Task 2: - key_table module doesn't need pub visibility; the KeyTable re-export at lib.rs root already exposes the only public type. Matches the error/history private-module pattern. - Revert an incidental bench variable rename (index_map → index) that wasn't part of the task scope. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:38:22 +02:00
Anders Olsson	c69fe4e67c	refactor(api): rename IndexMap to KeyTable The former name collided with the popular indexmap crate. KeyTable lives in its own module. Public API unchanged beyond the rename. Part of T2 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 10:34:14 +02:00
Anders Olsson	948a7a684b	docs: add T2 new-API-surface implementation plan 21-task plan covering all renames and new public API landing per Section 7 "T2" of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 10:31:33 +02:00
Anders Olsson	6437649436	perf(arena): pool team_prior/lhood/inv buffers to eliminate per-game allocs Move team_prior, lhood_lose, lhood_win, inv_buf into ScratchArena so their Vec capacity is reused across games in a Batch. Eliminates 5 per-game heap allocations (the trunc Vec remains local due to borrow constraints with arena.vars). Batch::iteration: 23.0 µs (down from 27.0 µs with naive local Vecs; 8% above T0 21.253 µs baseline due to TruncFactor propagate overhead).	2026-04-24 09:10:48 +02:00
Anders Olsson	cdfd75f846	bench: capture T1 final numbers and fix clippy warnings Fixed: - Removed unused .enumerate() in batch.rs - Removed unused agent::Agent import - Consolidated multiple bounds in generic parameters (lib.rs) - Suppressed dead_code for test-only code with #[allow(dead_code)] - Fixed unused imports and neg-multiply lint Batch::iteration: 27.023 µs (T0 was 21.253 µs, expected minor regression from T1 infrastructure). Gaussian::* unchanged (~236-280 ps). Acceptance: T1 factor-graph refactor lands without clippy/fmt issues. All 53 tests pass. Closes T1 tier.	2026-04-24 09:04:29 +02:00
Anders Olsson	c02d5ca0ab	perf(game): replace order.clone()+position() with inverse permutation	2026-04-24 08:58:09 +02:00
Anders Olsson	cdee7b2b99	fix(arena): remove unused Gaussian import in test module	2026-04-24 08:52:11 +02:00
Anders Olsson	cb07a874e8	refactor(game): rebuild Game::likelihoods on factor-graph machinery Game::likelihoods now uses VarStore (for diff vars) and TruncFactor (for EP truncation + evidence caching) instead of TeamMessage and DiffMessage. The EP loop structure is preserved exactly; VarId-keyed diff vars live in the arena's VarStore (capacity reused per batch). ScratchArena loses teams/diffs/ties/margins; gains VarStore and sort_buf (sort_perm allocation eliminated). message.rs deleted. Public API of Game (new, posteriors, likelihoods, evidence) unchanged.	2026-04-24 08:51:18 +02:00
Anders Olsson	da69f02ff7	feat(schedule): add Schedule trait and EpsilonOrMax impl EpsilonOrMax mirrors today's Game::likelihoods loop: sweep forward then backward over iterating factors, capped at 10 iterations or step <= 1e-6. Setup factors (TeamSum) run exactly once before the loop begins. ScheduleReport is the only public surface from this module.	2026-04-24 08:25:13 +02:00
Anders Olsson	54e46bef59	feat(factor): implement TruncFactor with cached evidence EP truncation factor that operates on a diff variable. Stores its outgoing message so the cavity computation produces the correct EP message on each propagation. The first propagation caches the evidence contribution (cdf-bounded probability) for log_evidence(). Promotes lib::cdf to pub(crate) so the factor can use it.	2026-04-24 08:22:06 +02:00
Anders Olsson	ae141752b7	feat(factor): implement RankDiffFactor Maintains diff = team_a - team_b across three variables. On each propagation, reads the team-perf marginals (which may have been updated by neighboring factors) and computes the new diff via Gaussian Sub (variance addition).	2026-04-24 08:19:18 +02:00
Anders Olsson	1210a34a64	fix(factor): move N_INF import to test module in team_sum	2026-04-24 08:17:54 +02:00
Anders Olsson	cee70c6272	feat(factor): implement TeamSumFactor Computes the weighted sum of player performance Gaussians into a team-performance variable. Runs once per game (no iteration needed). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 08:17:14 +02:00
Anders Olsson	ebccc7b454	feat(factor): introduce Factor trait and BuiltinFactor enum Adds the trait that all factors implement and the enum dispatcher used by the schedule to drive heterogeneous factors without dynamic dispatch in the hot loop. The three built-in factors (TeamSum, RankDiff, Trunc) are stubbed out; concrete implementations follow in tasks 4-6.	2026-04-24 08:14:00 +02:00
Anders Olsson	dac4427b65	feat(factor): introduce VarId and VarStore Foundation types for the T1 factor graph machinery. VarStore is a flat Vec<Gaussian> indexed by VarId; variables are allocated by alloc() and the store can be cleared between games to reuse capacity. Part of T1 of docs/superpowers/specs/2026-04-23-trueskill-engine-redesign-design.md.	2026-04-24 08:09:25 +02:00
Anders Olsson	fa85bcee51	docs: add T1 factor-graph implementation plan Bite-sized, TDD-style task breakdown for the second tier of the engine redesign: introduce VarStore, Factor trait, BuiltinFactor enum, and EpsilonOrMax schedule, then re-implement Game::likelihoods on top of the new machinery. Internal-only refactor; public Game/History API unchanged. Acceptance: existing tests pass within ULP, iteration counts match T0, no Batch::iteration regression vs T0 (~21.5 µs). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-24 07:42:33 +02:00
Anders Olsson	d3cfee53a1	bench: capture T0 final numbers and post-mortem Batch::iteration: 29.840 µs → 21.253 µs (1.40×) Gaussian::mul: 1.568 ns → 218.69 ps (7.17×) Gaussian::div: 1.572 ns → 218.64 ps (7.19×) Gaussian arithmetic hit target (7×+ vs 1.5–2× expected). Batch::iteration reached 1.40× vs the 3× target. Post-mortem: the bench exercises 100 tiny 2-team events and the dominant cost is still Vec allocation in within_priors, sort_perm, and Game::likelihoods. The HashMap→Vec win shows at the History level (forward/backward sweep) which this bench doesn't exercise. Remediation plan documented in benches/baseline.txt: arena-ify sort_perm, within_priors, and Game::likelihoods in T1 when Game's internals are redesigned around the new factor graph. 38/38 tests passing. Closes T0 tier.	2026-04-24 07:28:28 +02:00
Anders Olsson	b1e0fcb817	perf(game): eliminate per-event allocations via ScratchArena Game::likelihoods previously allocated four Vecs (teams, diffs, ties, margins) on every call. Batch now owns one ScratchArena reused across all Game::new calls in the iteration loop; likelihoods() clears and extends the arena buffers instead of allocating fresh. For log_evidence (called infrequently), a local ScratchArena is created per invocation so the method signature stays &self. Also: add #[derive(Debug)] to TeamMessage and DiffMessage (required by ScratchArena's own Debug derive). Part of T0 engine redesign.	2026-04-24 07:24:29 +02:00
Anders Olsson	49d2b317da	refactor(history): replace HashMap<Index, Agent<D>> with dense AgentStore<D> AgentStore<D> is a Vec<Option<Agent<D>>>-backed store indexed directly by Index.0, eliminating per-iteration hashing in the cross-history forward/backward sweep. Implements Index<Index>/IndexMut<Index> for ergonomic agent access. AgentStore is public (so benches/batch.rs can use it). SkillStore remains pub(crate) since Skill is pub(crate) in batch.rs. HashMap<Index, _> is now only used for the posteriors() return value (temporary; will be replaced in T2 with a proper typed return) and for the add_events_with_prior(priors: HashMap<Index, Player<D>>) API (also T2 target). Part of T0 engine redesign.	2026-04-24 07:15:21 +02:00
Anders Olsson	8f60258dba	refactor(batch): replace HashMap<Index, Skill> with dense SkillStore SkillStore is a Vec<Skill>-backed dense store with a parallel present mask, indexed directly by Index.0. Eliminates per-iteration hashing in the within-slice convergence loop; O(1) array lookup replaces O(1) amortised hash lookup with better cache behaviour. Iteration order is now ascending-by-Index (was arbitrary for HashMap); EP fixed point is order-independent so posteriors are unchanged. Part of T0 engine redesign.	2026-04-24 07:08:20 +02:00
Anders Olsson	709ece335f	feat: introduce InferenceError; mu_sigma panic already eliminated mu_sigma was deleted as part of the Gaussian nat-param rewrite (its only callers were the old Mul/Div impls). This commit adds the InferenceError enum as a seed for the T2 API surface, with the NegativePrecision variant that mu_sigma would have returned. Part of T0 engine redesign.	2026-04-24 07:00:26 +02:00
Anders Olsson	a667deb7e1	refactor(gaussian): switch to natural-parameter storage (pi, tau) Mul and Div become two f64 adds/subs with no sqrt in the hot path. mu() and sigma() are computed on demand from stored pi/tau. Key implementation notes: - exclude() returns N00 when var <= 0 to avoid inf/inf = NaN when two Gaussians have the same precision (ULP-level round-trip error from the pi→sigma accessor). - Mul<f64> by 0.0 returns N00 (point mass at 0), matching old behavior. - from_ms(0, 0) == N00 {pi:inf, tau:0}; from_ms(0, inf) == N_INF {pi:0, tau:0}. Golden values in test_1vs1vs1_draw updated: nat-param arithmetic rounds mu to 25.0 (was 24.999999) and shifts sigma by ~3e-7. Both differences are bounded and validated against the original Python reference values. Part of T0 engine redesign.	2026-04-24 06:59:43 +02:00
Anders Olsson	06d3c886fe	bench: capture T0 baseline; expose pi/tau accessors; fix div panic - Promotes Gaussian::pi and Gaussian::tau to public so benches/gaussian.rs compiles, then captures baseline numbers for the T0 acceptance gate. - Fixes the divide bench: g1/g2 panicked (g1 has lower precision than g2; cavity requires pi_num >= pi_den). Swapped to g2/g1 (well-defined). Baseline on Apple M5 Pro: Batch::iteration 29.840 µs Gaussian::mul 1.568 ns (vs ~220 ps for add/sub — hot path) Gaussian::div 1.572 ns	2026-04-24 06:43:00 +02:00
Anders Olsson	d11d2e8c6b	docs: add T0 numerical-parity implementation plan Bite-sized, TDD-style task breakdown for the first tier of the engine redesign: Gaussian to natural-parameter storage, dense Vec storage replacing HashMap, ScratchArena to eliminate per-event allocs, Result-ifying the lone panic. No top-level public API change. Acceptance gate: ≥3x speedup on Batch::iteration vs. baseline. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-23 22:43:27 +02:00
Anders Olsson	c5f081d21f	docs: add TrueSkill-TT engine redesign spec Comprehensive design for a multi-tier rewrite covering performance, factor-graph extensibility, convergence scheduling, and API surface. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-23 22:33:48 +02:00
Anders Olsson	a14df02089	chore: do not publish v0.1.0	2026-04-23 20:26:52 +02:00
Anders Olsson	0d266b4428	chore: make cargo release add CHANGELOG.md before commit	2026-04-23 20:26:16 +02:00
Anders Olsson	a4b4e5e8fa	chore: clean up	2026-04-23 20:24:10 +02:00
Anders Olsson	04d5478ee4	style: cargo fmt	2026-04-23 20:23:13 +02:00
Anders Olsson	480467ac32	chore: added cliff.toml, release.toml and rustfmt.toml	2026-04-23 20:22:27 +02:00
Anders Olsson	dc47964310	added benchmark	2026-03-23 14:55:18 +01:00
Anders Olsson	61a5507f5c	remove notepad	2026-03-23 14:21:23 +01:00
Anders Olsson	a1f282a1c8	feat: added a Drift trait and a "default" ConstantDrift implementation	2026-03-16 12:06:04 +01:00
Anders Olsson	853f177fa8	Small changes for new 2024 edition	2025-02-21 14:09:58 +01:00
Anders Olsson	fc0efcdc52	Update edition	2025-02-21 14:06:28 +01:00
Anders Olsson	3bbddb168f	Ignore temp folder	2024-04-03 14:43:54 +02:00

1 2 3

112 Commits