Add exploratory data analysis, more data preprocessing and features, and more models #9

schance995 · 2024-06-05T16:14:43Z

Here's @0mWh's and my solution so far. We plan to attempt all 3 UnitaryHack challenges. There are several changes, we look forward to any questions and feedback.

Notebooks

QRNG_ Classification_Main_UnitaryHack windowed.ipynb for models with more training data from data/QRNG_ Classification_Main_UnitaryHack windowed_preprocessed_df_1717557318.csv.zst (generated in same notebook)
QRNG_ Classification_Main_UnitaryHack.ipynb for more models, preprocessing, and exploratory data analysis.
process_logical_reduction.ipynb for distribution analysis and statistical testing

Changes so far

Best performance so far

We got 67% on one of our models, but we caution against further interpretation until we implement more robust model testing. A limitation is that we don't have a held-out test set for a fair comparison against other project submissions.

Next steps

…to default

- Cast data types to lower precision - Use HistGradientBoostingClassifier for faster classification - Printing the tree itself is no longer useful

It should never be included since the .pyc may be different between computers

- mean normalization - try all randomness tests - confusion matrices

0mWh · 2024-06-14T03:57:55Z

currently sitting at 75-77% accuracy

…x author)

schance995 and others added 17 commits June 5, 2024 12:15

Add sliding window features for model. Improves performance compared …

9cbc609

…to default

Update README

0dbe919

reprocess_data.py: Use new data file, change gap size to 2

a45e256

model.py: Add optimizations.

2e5c04a

- Cast data types to lower precision - Use HistGradientBoostingClassifier for faster classification - Printing the tree itself is no longer useful

Remove __pycache__

74da463

It should never be included since the .pyc may be different between computers

Update README

b97fe10

Update notebook to use parallelism and simple bugfixes (no new features)

9b84e60

Add notebook with sliding window. Does not really improve performance.

a123c0c

More notebook updates.

e8b3493

- mean normalization - try all randomness tests - confusion matrices

gitignore

b6ccaa7

logical reduction: xor

3a54303

ensure "runs on my machine"

cac8d13

ensure "runs on my machine" 2/3

6d21bbf

ensure "runs on my machine" 3/3

40d4b91

data!

abc3390

public updates notice

2b60ca2

Update UPDATES

c608313

schance995 force-pushed the unitaryhack branch from 8165e38 to c608313 Compare June 5, 2024 16:19

0mWh force-pushed the unitaryhack branch 2 times, most recently from e1e6749 to 8ef37b7 Compare June 20, 2024 22:37

remove machine 1: not a real quantum device (was a simulator) (and fi…

8bad307

…x author)

0mWh force-pushed the unitaryhack branch from 8ef37b7 to 8bad307 Compare June 20, 2024 22:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add exploratory data analysis, more data preprocessing and features, and more models #9

Add exploratory data analysis, more data preprocessing and features, and more models #9

schance995 commented Jun 5, 2024 •

edited

Loading

0mWh commented Jun 14, 2024

Add exploratory data analysis, more data preprocessing and features, and more models #9

Are you sure you want to change the base?

Add exploratory data analysis, more data preprocessing and features, and more models #9

Conversation

schance995 commented Jun 5, 2024 • edited Loading

Notebooks

Changes so far

Best performance so far

Next steps

0mWh commented Jun 14, 2024

schance995 commented Jun 5, 2024 •

edited

Loading