Complex Workflows.

Simple Solutions. 

Complex Workflows.

Simple Solutions. 

DataSafari collapses complex data science workflows into clean, powerful one-liners.

DataSafari collapses complex data science workflows into clean, powerful one-liners.

|
Scroll Down Arrow

DataSafari at a Glance

Every function maps to a real step in your workflow — explore, transform, test, model.

Explore

Instant & Robust Exploratory data analysis.

Drop into any dataset blind. DataSafari's Explorer suite maps the landscape — patterns, outliers, distributions, correlations — before you write a second line of code.

Test Hypotheses

Data evaluation, Test selection & Result Interpretation. Automatically.

Pass in your variables. DataSafari's Predictor suite checks your assumptions, selects the right test automatically, and returns results with plain-language conclusions — all in one command.

Make Inferences

Automatic model selection & interpretation.

DataSafari's Inferential Predictor evaluates models across multiple metrics simultaneously, then surfaces the best fit. Full control over metric prioritization and selection when you need it.

Transform

Swiss knife of Data Transformation.

Clean, encode, and engineer features in one call. DataSafari's Transformer suite handles numerical and categorical data so your inputs are ML-ready before you touch a model.

Build ML Models

Preprocessing, Tuning & Model evaluation. Seamlessly.

Specify your variables. DataSafari's ML Predictor suite handles preprocessing, evaluates multiple models, tunes hyperparameters, and hands you the top performers — ready for deployment.

Evaluate

Model Assumptions, Data Types & More.

Verify normality, equal variance, and other critical assumptions before you commit to a test. DataSafari's Evaluators run the checks and explain the results in plain language.

DataSafari at a Glance

Every function maps to a real step in your workflow — explore, transform, test, model.

Explore

Instant & Robust Exploratory data analysis.

Drop into any dataset blind. DataSafari's Explorer suite maps the landscape — patterns, outliers, distributions, correlations — before you write a second line of code.

Transform

Swiss knife of Data Transformation.

Clean, encode, and engineer features in one call. DataSafari's Transformer suite handles numerical and categorical data so your inputs are ML-ready before you touch a model.

Test Hypotheses

Data evaluation, Test selection & Result Interpretation. Automatically.

Pass in your variables. DataSafari's Predictor suite checks your assumptions, selects the right test automatically, and returns results with plain-language conclusions — all in one command.

Build ML Models

Preprocessing, Tuning & Model evaluation. Seamlessly.

Specify your variables. DataSafari's ML Predictor suite handles preprocessing, evaluates multiple models, tunes hyperparameters, and hands you the top performers — ready for deployment.

Make Inferences

Automatic model selection & interpretation.

DataSafari's Inferential Predictor evaluates models across multiple metrics simultaneously, then surfaces the best fit. Full control over metric prioritization and selection when you need it.

Evaluate

Model Assumptions, Data Types & More.

Verify normality, equal variance, and other critical assumptions before you commit to a test. DataSafari's Evaluators run the checks and explain the results in plain language.

Get Started

installation

DataSafari is available on PyPI and can be installed using pip or Poetry.

1. Install DataSafari in your Terminal

2. Import DataSafari in your Python script

Usage

For full API reference, quickstart guides, and real-world examples, read the docs.

For full API reference, quickstart guides, and real-world examples, read the docs.

Hypothesis Testing with DataSafari

Code Block with Syntax Highlighting
python
from datasafari.predictor import predict_hypothesis
import pandas as pd
import numpy as np

# Create a sample DataFrame
df_hypothesis = pd.DataFrame({
    'Group': np.random.choice(['Control', 'Treatment'], size=100),
    'Score': np.random.normal(0, 1, 100)
})

# Perform hypothesis testing
results = predict_hypothesis(df_hypothesis, 'Group', 'Score')
        

Open Source

DataSafari is built on a foundation of transparency. Distributed under GNU GPL v3.

Access Source Code Button