๐ŸŽ“ Student project from MITS Gwalior ยท Open source & completely free View on GitHub โ†’
Live on HuggingFace Spaces ยท v2.0

Detect fraud.
Understand why.

Upload any binary classification CSV and get fraud predictions, SHAP explainability, and cost-optimized thresholds โ€” no code required.

๐Ÿš€ Launch Live Demo โญ Star on GitHub
0.975ROC-AUC
285Tests Passing
99%Code Coverage
2M+Rows Supported

Capabilities

Everything built in.
Nothing to configure.

Built for analysts, researchers, and students who want production ML power without writing a single line of code.

๐Ÿค–
AutoML Engine

Automatically selects and tunes the best model for your dataset. Supports 100 rows all the way up to 2 million rows across 6 smart size tiers.

๐Ÿ”
SHAP Explainability

Every prediction comes with a full SHAP breakdown โ€” see exactly which features drove the fraud decision and by how much.

๐Ÿ’ฐ
Cost-Optimized Threshold

Balances false positives vs false negatives based on your real business cost model โ€” not just raw accuracy metrics.

๐Ÿ“ฆ
Batch Processing

Upload a CSV with thousands of transactions. Get risk scores, confidence intervals, and downloadable results instantly.

๐ŸŒŠ
Drift Detection

Monitors your model for data drift and performance degradation over time โ€” alerts you before accuracy degrades in production.

๐Ÿ“Š
MLflow Registry

Full model versioning with MLflow. Champion/challenger promotion, experiment tracking, and fully reproducible training runs.


Workflow

Three steps.
Production results.

No ML knowledge required. Just bring your data.

01
Upload Your CSV

Drop any binary classification dataset. AutoML-X detects columns, handles missing values, and engineers features automatically โ€” no preprocessing needed.

02
Model Scores Risk

The engine trains, evaluates, and selects the best model. You get fraud probability scores with confidence intervals for every single row.

03
Review & Decide

Explore SHAP explanations per prediction, tune your business decision threshold, and download results โ€” all in a clean, interactive dashboard.


Model Performance

Numbers that speak
for themselves.

Validated on real transaction data with SMOTE-balanced training and cost-optimized threshold selection.

ROC-AUC Score
0.975
Near-perfect separation between fraud and legitimate transactions on the test set.
Fraud Recall
89.8%
Catches almost 9 in 10 fraudulent transactions before they cause financial damage.
Test Coverage
99%
285 passing tests with a full CI/CD pipeline running on every commit via GitHub Actions.
automl-x ยท evaluation output
# Running AutoML-X evaluation pipeline
$ python evaluate.py --dataset fraud_sample.csv
Loading dataset... 50,000 rows ยท 28 features
Applying SMOTE balancing... done
Training candidates: RandomForest ยท XGBoost ยท LightGBM
Best model selected: RandomForestClassifier
ROC-AUC : 0.9752
Recall : 0.898 # cost-threshold applied
Threshold: 0.63 # business-cost optimized
$ pytest --cov=src tests/ -q
285 passed, 0 failed ยท coverage: 99% ยท 12.4s

Technology

Production-grade stack.

Built with battle-tested open source tools. Dockerized and deployed on HuggingFace Spaces with full CI/CD automation.

๐Ÿ Python 3.11 โšก Streamlit ๐Ÿ”ฌ scikit-learn ๐Ÿ’ก SHAP ๐Ÿ“ˆ MLflow ๐Ÿณ Docker ๐Ÿค— HuggingFace Spaces ๐Ÿงช pytest ยท 99% cov ๐Ÿ”„ GitHub Actions CI ๐Ÿ“Š Plotly โš–๏ธ SMOTE ๐Ÿš€ FastAPI ๐Ÿ—„๏ธ SQLite ยท MLflow Registry ๐Ÿ“ก supervisord

Ready to detect fraud
in your data?

Free, open-source, and running live right now. No signup required.

๐Ÿš€ Launch AutoML-X View Source Code