Live on HuggingFace Spaces · v2.0

Detect fraud.
Understand why.

Upload any binary classification CSV and get fraud predictions, SHAP explainability, and cost-optimized thresholds — no code required.

🚀 Launch Live Demo ⭐ Star on GitHub

0.975ROC-AUC

285Tests Passing

99%Code Coverage

2M+Rows Supported

Capabilities

Everything built in.
Nothing to configure.

Built for analysts, researchers, and students who want production ML power without writing a single line of code.

🤖

AutoML Engine

Automatically selects and tunes the best model for your dataset. Supports 100 rows all the way up to 2 million rows across 6 smart size tiers.

🔍

SHAP Explainability

Every prediction comes with a full SHAP breakdown — see exactly which features drove the fraud decision and by how much.

💰

Cost-Optimized Threshold

Balances false positives vs false negatives based on your real business cost model — not just raw accuracy metrics.

📦

Batch Processing

Upload a CSV with thousands of transactions. Get risk scores, confidence intervals, and downloadable results instantly.

🌊

Drift Detection

Monitors your model for data drift and performance degradation over time — alerts you before accuracy degrades in production.

📊

MLflow Registry

Full model versioning with MLflow. Champion/challenger promotion, experiment tracking, and fully reproducible training runs.

Workflow

Three steps.
Production results.

No ML knowledge required. Just bring your data.

Upload Your CSV

Drop any binary classification dataset. AutoML-X detects columns, handles missing values, and engineers features automatically — no preprocessing needed.

Model Scores Risk

The engine trains, evaluates, and selects the best model. You get fraud probability scores with confidence intervals for every single row.

Review & Decide

Explore SHAP explanations per prediction, tune your business decision threshold, and download results — all in a clean, interactive dashboard.

Model Performance

Numbers that speak
for themselves.

Validated on real transaction data with SMOTE-balanced training and cost-optimized threshold selection.

ROC-AUC Score

0.975

Near-perfect separation between fraud and legitimate transactions on the test set.

Fraud Recall

89.8%

Catches almost 9 in 10 fraudulent transactions before they cause financial damage.

Test Coverage

99%

285 passing tests with a full CI/CD pipeline running on every commit via GitHub Actions.

automl-x · evaluation output

# Running AutoML-X evaluation pipeline

$ python evaluate.py --dataset fraud_sample.csv

Loading dataset... 50,000 rows · 28 features

Applying SMOTE balancing... done

Training candidates: RandomForest · XGBoost · LightGBM

Best model selected: RandomForestClassifier

ROC-AUC : 0.9752

Recall : 0.898 # cost-threshold applied

Threshold: 0.63 # business-cost optimized

$ pytest --cov=src tests/ -q

285 passed, 0 failed · coverage: 99% · 12.4s

Technology

Production-grade stack.

Built with battle-tested open source tools. Dockerized and deployed on HuggingFace Spaces with full CI/CD automation.

🐍 Python 3.11 ⚡ Streamlit 🔬 scikit-learn 💡 SHAP 📈 MLflow 🐳 Docker 🤗 HuggingFace Spaces 🧪 pytest · 99% cov 🔄 GitHub Actions CI 📊 Plotly ⚖️ SMOTE 🚀 FastAPI 🗄️ SQLite · MLflow Registry 📡 supervisord

Detect fraud.Understand why.

Everything built in.Nothing to configure.

Three steps.Production results.

Numbers that speakfor themselves.

Production-grade stack.

Ready to detect fraudin your data?

Detect fraud.
Understand why.

Everything built in.
Nothing to configure.

Three steps.
Production results.

Numbers that speak
for themselves.

Ready to detect fraud
in your data?