Generated by just snapshot from artifacts/full_with_peer at 2026-05-11T05:44:55+00:00.
Discussion
Research question. Can filing-origin public SEC/PCAOB information predict whether an issuer later enters observable public review-and-correction channels, and how does this public reporting-risk construct relate to, but differ from, legacy detected-misstatement benchmarks?
Data. The workflow combines the legacy gvkey x data_year detected-misstatement benchmark, the public SEC/PCAOB lake, the gold issuer_origin_panel and filing_origin_panel, and an external gvkey-CIK-year bridge for overlap validation.
Models. The core public cascade uses XGBoost over metadata, XBRL, text/notes, auditor, oversight, and all-feature sets. Peer-compatible Dechow, Perols, Bao, and Bertomeu-style suites are included when the peer-enabled study directory is present.
Metrics. The common metric vocabulary is PR-AUC relative to prevalence, ROC-AUC, Brier, Brier Skill Score, ECE, top-k precision, top-decile lift, and Bao-style top-fraction precision, sensitivity, specificity, BAC, and NDCG.
Sellable claim. The strongest current framing is a measurement-and-ranking paper on filing-origin public reporting-risk states. It does not support causal claims, unobserved true-fraud occurrence claims, or same-estimand performance rankings over prior fraud-prediction papers.
Current best public-cascade specification.all + rolling_7y with reported mean PR-AUC 0.2430.
Bridge boundary. Construct overlap is candidate_farr; WRDS or equivalent institutional bridge evidence remains preferred for final manuscript-grade integrated claims.
flowchart LR
L["Legacy benchmark<br/>timing, drift, missingness,<br/>peer-compatible metrics"]
P["Public filing-origin cascade<br/>comment threads, amendments,<br/>8-K Item 4.02, AAER support"]
B["Bridge gate<br/>gvkey-CIK-year coverage<br/>candidate_farr unless WRDS supplied"]
V["Construct-overlap checks<br/>co-occurrence, lift,<br/>reciprocal ranking, event time"]
S["Snapshot docs<br/>generated from artifacts<br/>checked by just snapshot"]
L --> B
P --> B
B --> V
V --> S
These rows are present only when the peer-enabled study has run. They are model-family transfer and metric-language alignment, not exact replications of the original-paper samples.
Model
Rows
Mean PR-AUC
Mean ROC-AUC
Max PR-AUC
Mean Brier
bertomeu_style_xgb
336
0.0427
0.6601
0.1710
0.0162
perols_logit
336
0.0315
0.6156
0.0759
0.1775
perols_bagged
336
0.0311
0.6271
0.0868
0.1809
perols_linear_svm
336
0.0306
0.6131
0.0745
0.1862
perols_stacking
336
0.0302
0.6075
0.0708
0.1967
perols_mlp
336
0.0297
0.5888
0.0716
0.2022
bao_inspired_tree_ensemble
336
0.0283
0.6251
0.0628
0.0165
dechow_variable_logit
336
0.0235
0.5225
0.0672
0.2466
perols_entropy_tree
336
0.0227
0.5810
0.0444
0.2245
Public-Label Peer Transfer
Model
Rows
Mean PR-AUC
Mean ROC-AUC
Max PR-AUC
Mean Brier
bertomeu_style_xgb
480
0.2247
0.6453
0.5082
0.1124
bao_inspired_tree_ensemble
480
0.2245
0.6449
0.5063
0.1124
perols_bagged
480
0.2119
0.6319
0.4561
0.2238
perols_stacking
480
0.2074
0.6184
0.4454
0.2224
perols_linear_svm
480
0.2056
0.6166
0.6283
0.2242
perols_logit
480
0.2056
0.6217
0.5472
0.2252
perols_mlp
480
0.2002
0.6070
0.4413
0.2324
perols_entropy_tree
480
0.1997
0.6125
0.4264
0.2287
dechow_variable_logit
480
0.1584
0.5348
0.3084
0.2500
Public Peer Task Summary
Task
Rows
Mean prevalence
Mean PR-AUC
Mean ROC-AUC
Max PR-AUC
comment_thread
1,440
0.2615
0.3284
0.5915
0.5082
amendment
1,440
0.1552
0.2326
0.6097
0.3854
8k_402
1,440
0.0221
0.0516
0.6432
0.6283
Bridge and Construct-Overlap Validation
Metric
Value
raw_rows
82,908
raw_firms
9,156
matched_raw_rows
81,218
matched_raw_firms
9,075
row_coverage_rate
0.9796
firm_coverage_rate
0.9912
raw_positive_rows
2,460
matched_positive_rows
2,433
Direction
Model
Target
PR-AUC
ROC-AUC
Top-decile lift
Public cascade score -> legacy positives
public_cascade
8k_402
0.0326
0.6828
2.9462
Legacy/peer score -> public labels
bertomeu_style_xgb
label_8k_402_365
0.0436
0.7033
3.0477
Key readings:
Public labels and legacy detected-misstatement labels are related but non-identical constructs.
Public-cascade scores can rank legacy positives in the matched overlap; legacy/peer scores can also rank severe public correction labels.
candidate_farr bridge evidence is useful for internal validation, but should be labeled clearly until a WRDS-grade bridge is available.
Selected Artifact Index
This index lists high-signal artifacts referenced by this generated snapshot.