DataSafari collapses complex data science workflows into clean, powerful one-liners.
DataSafari collapses complex data science workflows into clean, powerful one-liners.
Every function maps to a real step in your workflow — explore, transform, test, model.
Drop into any dataset blind. DataSafari's Explorer suite maps the landscape — patterns, outliers, distributions, correlations — before you write a second line of code.
Pass in your variables. DataSafari's Predictor suite checks your assumptions, selects the right test automatically, and returns results with plain-language conclusions — all in one command.
DataSafari's Inferential Predictor evaluates models across multiple metrics simultaneously, then surfaces the best fit. Full control over metric prioritization and selection when you need it.
Clean, encode, and engineer features in one call. DataSafari's Transformer suite handles numerical and categorical data so your inputs are ML-ready before you touch a model.
Specify your variables. DataSafari's ML Predictor suite handles preprocessing, evaluates multiple models, tunes hyperparameters, and hands you the top performers — ready for deployment.
Verify normality, equal variance, and other critical assumptions before you commit to a test. DataSafari's Evaluators run the checks and explain the results in plain language.
Every function maps to a real step in your workflow — explore, transform, test, model.
Drop into any dataset blind. DataSafari's Explorer suite maps the landscape — patterns, outliers, distributions, correlations — before you write a second line of code.
Clean, encode, and engineer features in one call. DataSafari's Transformer suite handles numerical and categorical data so your inputs are ML-ready before you touch a model.
Pass in your variables. DataSafari's Predictor suite checks your assumptions, selects the right test automatically, and returns results with plain-language conclusions — all in one command.
Specify your variables. DataSafari's ML Predictor suite handles preprocessing, evaluates multiple models, tunes hyperparameters, and hands you the top performers — ready for deployment.
DataSafari's Inferential Predictor evaluates models across multiple metrics simultaneously, then surfaces the best fit. Full control over metric prioritization and selection when you need it.
Verify normality, equal variance, and other critical assumptions before you commit to a test. DataSafari's Evaluators run the checks and explain the results in plain language.
DataSafari is available on PyPI and can be installed using pip or Poetry.
For full API reference, quickstart guides, and real-world examples, read the docs.
For full API reference, quickstart guides, and real-world examples, read the docs.
from datasafari.predictor import predict_hypothesis import pandas as pd import numpy as np # Create a sample DataFrame df_hypothesis = pd.DataFrame({ 'Group': np.random.choice(['Control', 'Treatment'], size=100), 'Score': np.random.normal(0, 1, 100) }) # Perform hypothesis testing results = predict_hypothesis(df_hypothesis, 'Group', 'Score')