bigbounce/spin-torsion cosmology research program
research live

Papers→ Paper 3

Paper 3HUBIFY-2026-00332 pages71 refsTarget: ApJS

Paper 3

Spectrally Unusual Sources at Scale: A Multi-Survey Catalog of 378,280 Path-C Unique Anomalies and a Native-Trained Novelty Fraction from 37.3 Million Sources and Map Patches

80% · active
v3.1.135
● live
0B/0M/1m/0C open

A 378,280-object anomaly catalog mined from 37.3M sources across 7 surveys with one autoencoder architecture — 17.8% of top-ranked objects are new to existing catalogs, plus a NANOGrav free-spectrum fit consistent with matter-bounce γ = 3.0.

Paper Artifacts
PDF · 32 pp · v3.1.130 · updated Jul 1, 2026 · md5 68a38fa2ApJS
Readiness● live80%

0B/0M/1m/0C open · July 3, 2026

The multi-survey anomaly catalog: 378,280 unique anomalies from 37.3 million sources across 7 surveys via one BigAE autoencoder architecture, with a 17.8% novelty rate at the top-1,000 stratum against 20 all-sky catalogs. A NANOGrav 15-yr free-spectrum re-fit gives γ = 2.567 ± 0.382 — matter-bounce γ=3.0 is consistent (+1.13σ) while SMBHB γ=4.33 is excluded (+4.61σ). Multi-tracer forecast: σ(fNL) = 8.27 ± 2.37.

Path to publication
  1. Draft completedone
  2. Internal multi-model reviewdone

    Native-PDF autoloop: Claude · GPT · Gemini · Grok · Perplexity

  3. Cross-vendor rounds cleandone

    3+ consecutive clean 5-vendor rounds (§4.4.1)

  4. External journal-style reviewin progress

    3 rigorous review rounds complete (Round A/B/C, INT+EXT, Jun 28-30 2026) on top of the de-biased external validation: 23 real items closed program-wide; final neutral truth-audit found 0 genuinely-new real findings. Verified internally honest; remaining external MAJORs are disclosed caveats + submission-time DOI/arXiv blockers + LLM-referee variance. Houston sign-off pending.

  5. Houston sign-offwaiting on Houston

    The final 1% — Houston only

  6. arXiv submissionqueued

    Needs endorsement · order P4 → P1A+P1B → P3 → P2 → P5

Focus areasNovelty: N3
  • 378,280 anomalies headline (=378,080 + 200) across 7 surveys
  • 7-way 5″ positional FoF dedup arithmetic (10,213 = 637 + 9,576)
  • Fisher-positivity caveats in §6 — canonical envelope σ(fNL) ∈ [3.92, 8.98] under 1/σ² = F_0 + c·α² (NOT the retracted symmetric ±2.37 form)
  • σ(fNL)=8.14 central at empirical α=0.19 jackknife (jk dispersion 0.65) at < from null
  • v3.1.71 cross-vendor R-round (Grok+GPT+Perplexity, 13 findings) — 0 VERIFIED, 13 STALE → clean-round closure deliverable
  • NANOGrav 15-yr γ = 2.567 ± 0.382 (real-KDE Zenodo emcee fit); matter-bounce γ=3.0 at +1.13σ; SMBHB γ=4.33 at +4.61σ; Savage-Dickey B_mb/SMBHB = 7,138 decisive
Notable contributions
  • Largest-scale autoencoder anomaly catalog to date: 378,280 unique anomalies across 7 archives (37.3M sources)
  • 141× scale of the largest prior single-survey spectroscopic anomaly catalog (Liang et al.\ 2023)
  • Path-C rebuild protocol addresses cross-transfer artifacts: 21.5× anomaly-rate reduction after LAMOST native retraining
  • Empirical multi-tracer α=0.19 ± 0.65 measured from QSO-candidate angular correlation; σ(fNL) \in [3.92, 8.98]
Review history (94 rounds)
2026-07-02RS15-targeted-2026-07-02RS15 targeted re-sweep — P4 morphology closure LIFTS residual-attribution flag (Grok+Gemini both MINOR, 0 MAJOR); P3 §IID/§III consistency fix CLEARS on both vendors
2026-07-01RS-FLOOR-SKILLS-2026-07-01Pattern-066 convergence adopted: '0 genuinely-new real findings' is the terminating gate
2026-07-01RS11-2026-07-01EXT RS11 — CONVERGENCE FLOOR: 0 genuinely-new real findings across all 6 papers
2026-07-01RS10-CLOSURE-2026-07-01RS10 closure: P4 T5 stat-bug removed, P1B sigma-distance scoped out, P3 REJECT was a misread
2026-07-01RS10-2026-07-01EXT RS10: 0/6 converge — fresh read surfaced 3 genuinely-new findings
2026-07-01RS9-2026-07-01EXT RS9: P4/P5/P1B all Grok+Gemini MINOR — closest yet

Full review timeline for Paper 3

External peer review kit

Houston-driven manual round · paste prompt into any frontier LLM web UI with the PDF attached

One click to copy a referee prompt scoped to this paper. One click to download the latest PDF. Paste both into Claude / GPT-5 / Gemini / Grok / Perplexity — return findings here and the autonomous cron will close them in the next bundled hard-fix wave.

Download latest PDFOpen in tab
preview prompt
You are an external referee for MNRAS / Physical Review D / JCAP (target journal depends on paper).

Attached: Paper 3 v3.1.135 — "Spectrally Unusual Sources at Scale: A Multi-Survey Catalog of 378,280 Path-C Unique Anomalies and a Native-Trained Novelty Fraction from 37.3 Million Sources and Map Patches"
Source: pipelines/p3_anomaly_engine/paper3_draft.tex
PDF: PDF · 32 pp · v3.1.130 · updated Jul 1, 2026 · md5 68a38fa2

Read the FULL PDF end-to-end. Produce a referee report in MNRAS format with:

1. Recommendation: ACCEPT / MINOR REVISIONS / MAJOR REVISIONS / REJECT
2. BLOCKERS (must fix before publication) — list each with section/line + proposed fix
3. MAJORS (should fix) — same format
4. MINORS (polish) — same format
5. Strengths (>= 3 bullet points)
6. Specific scrutiny on:
   - 378,280 anomalies headline (=378,080 + 200) across 7 surveys
   - 7-way 5″ positional FoF dedup arithmetic (10,213 = 637 + 9,576)
   - Fisher-positivity caveats in §6 — canonical 1σ envelope σ(f_NL) ∈ [3.92, 8.98] under 1/σ² = F_0 + c·α² (NOT the retracted symmetric ±2.37 form)
   - σ(f_NL)=8.14 central at empirical α=0.19 jackknife (jk dispersion 0.65) at <1σ from null
   - v3.1.71 cross-vendor R-round (Grok+GPT+Perplexity, 13 findings) — 0 VERIFIED, 13 STALE → clean-round closure deliverable
   - NANOGrav 15-yr γ = 2.567 ± 0.382 (real-KDE Zenodo emcee fit); matter-bounce γ=3.0 at +1.13σ; SMBHB γ=4.33 at +4.61σ; Savage-Dickey B_mb/SMBHB = 7,138 decisive

CALIBRATION (do not burn findings on these known classes):
- The current date is June 2026. arXiv identifiers of the form 25xx.xxxxx and 26xx.xxxxx are VALID, already-published preprints — do not flag them as "future-dated" or "nonexistent". Verify a citation against arXiv/ADS before claiming it does not exist.
- Correction notes, retraction notices, and "an earlier version stated X" disclosures in the text are DELIBERATE transparency policy. Flag them only if their content is wrong, never for existing.
- Companion-paper citations marked "posted concurrently on arXiv" are deliberate placeholders; real arXiv IDs are inserted during the coordinated submission sequence.
- Explicitly labeled conservatism allowances, scaling estimates, ansatz/heuristic status labels, and disclosed queued follow-up computations are deliberate scoping, not oversights — flag only if the label itself is inaccurate.
- PDF text extraction can mangle math (square roots, fractions, superscripts). Before flagging "garbled" or "wrong" math, consider extraction artifacts; flag only what is visibly wrong in the rendered PDF.

VERDICT STANDARD (apply the SAME high bar a first-pass Physical Review D / MNRAS referee would — this is one of the most rigorous journals in the world):
- Assign each finding's severity (BLOCKER / MAJOR / MINOR) by your own independent referee judgment. Do NOT default to any tier, and do NOT soften a finding because the rest of the paper is strong. Do not echo this prompt's context.
- A reporting choice that headlines the more favorable of two numbers, an unstated assumption, an uncontrolled systematic, or an internal inconsistency IS a real finding — classify it honestly (MINOR at minimum), not as mere "style" or "opinion".
- Truth-audit any claim that seems off by checking it against the published .tex / on-disk artifacts before flagging (this only filters out genuine extraction artifacts — it does not lower the bar on real defects).
Key Results
  • 01378,280 unique anomalies across 7 non-quarantined surveys from 37.3M sources (Wave 11 retitle)
  • 02Catalog-grade tier: 269,317 unique entries (point-source subset 269,117; DESI + SDSS native + eROSITA + Planck native + Gaia + NEOWISE); point-source tier 378,080; Planck CMB-patch tier 200
  • 03LAMOST 113,342 reclassified as exploratory tier (FAIL: ~56% B-dominant cross-transfer empirical; native retrain 21.4x reduction to 2,054 at S>5)
  • 04100k OOD validation (Wave 5 B10): median MSE 0.178, p99 = 44.85, 0.87% DESI anomaly rate preserved
  • 055-fold OOS Jaccard J̄ = 0.862 PASS on real DESI 47k-spectra retrain (Path-C exit criterion)
  • 06NANOGrav 15-yr KDE free-spectrum γ = 2.567 ± 0.382 — bounce γ=3.0 consistent (+1.13σ), SMBHB γ=4.33 excluded (+4.61σ)
  • 07Empirical α_jk = 0.19 ± 0.65 (consistent with zero at 0.29σ); σ(fNL) = 8.27 ± 2.37 multi-tracer forecast; SPHEREx 4.38σ for fNL = -35/8
  • 0858.8% novel objects (not in SIMBAD); injection/recovery 0% false positive at 10–1,377× enrichment
  • 09eROSITA enrichment statistic reframed as descriptive (selection-coupled), not an inferential significance (v3.1.80 wave)
  • 10Abstract novelty rate arithmetic-anchored 7.9% → 9.4%; gold/silver novelty tiers defined (116 + 1,006 = 1,122); FoF dedup audit artifact committed (v3.1.81 R23conf wave)
  • 11eROSITA 0.259 threshold-axis irreproducibility disclosed in-text; abstract framing upgraded with Wilson CI, prior qualifiers, and de-biased ordering (v3.1.82 R24conf wave)
  • 12eROSITA axis definitively resolved: monotone-rescaling class ruled out (Spearman ρ = −0.10); membership-is-canonical framing adopted (v3.1.83 wave)
  • 13Catalog-grade count corrected 264,938 → 269,317 (prior count double-removed 4,379 LAMOST-overlap objects; independent 6-way dedup verified); HEALPix 38,330px unrecoverable confirmed → reproducible rerun 24,049px χ²_ν=15.7; SMICA preprocessing documented, 200/200 top-200 reproduction (v3.1.84 pod wave)
  • 14R26conf: zero arithmetic errors across the round; 12 textual closures — cluster accounting made exact from the dedup artifact, NANOGrav Eq. E1 claim falsified by rederivation (v3.1.87)
  • 15TARGETTYPE-restricted recount completed: 2,468 DESI anomaly clusters (1.3%) sit on main-survey science-class spectra — ≈0.9× the Liang 2023 benchmark restricted, not 73×; ~98.7% of DESI anomalies fall on sky/secondary/filler spectra (v3.1.93 EXT3-B2 closure)
  • 16R32conf closure wave: recount-at-a-glance table added (3-vendor convergent ask), irreproducible S_BigAE column stripped from the eROSITA table, title moved to singular novelty fraction, SMBHB abstract framing tightened to 'not a cosmological detection' (v3.1.94)
  • 17R33conf confirmation CLEAN-after-audit: zero closure-introduced regressions across all 12 closures, 2nd consecutive zero-arithmetic round; abstract envelope clause + auditable numeric Fisher mapping landed; EXT4-eligible (v3.1.95)
  • 18FM1 eROSITA scaler-refit computed (was queued): scaler effect ≤ model-retrain reproducibility floor — top-298 overlap 257/298, Spearman 0.94; rates/rankings robust, ~15% quantified extreme-tail membership churn (v3.1.96)