Complex Workflows.

Simple Solutions. 

Complex Workflows.

Simple Solutions. 

DataSafari simplifies complex data science tasks into straightforward, powerful one-liners. 

DataSafari simplifies complex data science tasks into straightforward, powerful one-liners. 

|
Scroll Down Arrow

DataSafari at a Glance

The intuition behind DataSafari's function suite is inspired by the typical data workflow.

Explore

Instant & Robust Exploratory data analysis.

Plunge into new datasets with a suite of tools tailored for deep explorative analysis. Uncover patterns, pinpoint issues, and access detailed statistics, facilitating a thorough understanding of your data's landscape with DataSafari's Explorer suite.

Test Hypotheses

Data evaluation, Test selection & Result Interpretation. Automatically.

Enter your variables of interest and watch as DataSafari's Predictor suite takes over—automatically evaluating your data to select the optimal hypothesis test. Enjoy a seamless experience from data assessment to insightful scientific interpretation—all through a single command.

Make Inferences

Automatic model selection & interpretation.

Discover the ideal statistical model for your data automatically, guided by a synthesis of diverse metrics. DataSafari's Inferential Predictor suite is built for depth and adaptability, offering extensive customization options from metric prioritization to model selection.

Transform

Swiss knife of Data Transformation.

Transform and refine your data with powerful tools that clean, encode, and engineer features for both numerical and categorical data. DataSafari's Transformer suite enhances data quality, ensuring your inputs are primed for high-level analysis and machine learning.

Build ML Models

Preprocessing, Tuning & Model evaluation. Seamlessly.

Streamline the creation and evaluation of machine learning models with tools that handle everything from preprocessing to tuning. Just specify your variables, and DataSafari's ML Predictor suite will deliver the best models, tuned and ready for deployment.

Evaluate

Model Assumptions, Data Types & More.

Effortlessly verify critical statistical assumptions like equal variance and normality. DataSafari's Evaluators provide detailed analyses and interpretations, enabling you to integrate these insights seamlessly into your data-driven decision-making processes.

DataSafari at a Glance

The intuition behind DataSafari's function suite is inspired by the typical data workflow.

Explore

Instant & Robust Exploratory data analysis.

Plunge into new datasets with a suite of tools tailored for deep explorative analysis. Uncover patterns, pinpoint issues, and access detailed statistics, facilitating a thorough understanding of your data's landscape with DataSafari's Explorer suite.

Transform

Swiss knife of Data Transformation.

Transform and refine your data with powerful tools that clean, encode, and engineer features for both numerical and categorical data. DataSafari's Transformer suite enhances data quality, ensuring your inputs are primed for high-level analysis and machine learning.

Test Hypotheses

Data evaluation, Test selection & Result Interpretation. Automatically.

Enter your variables of interest and watch as DataSafari's Predictor suite takes over—automatically evaluating your data to select the optimal hypothesis test. Enjoy a seamless experience from data assessment to insightful scientific interpretation—all through a single command.

Build ML Models

Preprocessing, Tuning & Model evaluation. Seamlessly.

Streamline the creation and evaluation of machine learning models with tools that handle everything from preprocessing to tuning. Just specify your variables, and DataSafari's ML Predictor suite will deliver the best models, tuned and ready for deployment.

Make Inferences

Automatic model selection & interpretation.

Discover the ideal statistical model for your data automatically, guided by a synthesis of diverse metrics. DataSafari's Inferential Predictor suite is built for depth and adaptability, offering extensive customization options from metric prioritization to model selection.

Evaluate

Model Assumptions, Data Types & More.

Effortlessly verify critical statistical assumptions like equal variance and normality. DataSafari's Evaluators provide detailed analyses and interpretations, enabling you to integrate these insights seamlessly into your data-driven decision-making processes.

Get Started

installation

DataSafari is available on PyPI and can be installed using pip or Poetry.

1. Install DataSafari in your Terminal

2. Import DataSafari in your Python script

Usage

For extensive usage examples and instructions, visit DataSafari's documentation.

For extensive usage examples and instructions, visit DataSafari's documentation.

Hypothesis Testing with DataSafari

Code Block with Syntax Highlighting
python
from datasafari.predictor import predict_hypothesis
import pandas as pd
import numpy as np

# Create a sample DataFrame
df_hypothesis = pd.DataFrame({
    'Group': np.random.choice(['Control', 'Treatment'], size=100),
    'Score': np.random.normal(0, 1, 100)
})

# Perform hypothesis testing
results = predict_hypothesis(df_hypothesis, 'Group', 'Score')
        

Open Source

DataSafari is built on a foundation of transparency. Distributed under GNU GPL v3.

Access Source Code Button