The open-source tool for building high-quality datasets
Running large language models on a single GPU
Python library for portfolio optimization built on top of scikit-learn
A reactive notebook for Python
Probabilistic reasoning and statistical analysis in TensorFlow
Python Stream Processing
DeepVariant is an analysis pipeline that uses a deep neural networks
150+ quantitative finance Python programs
Evaluate and monitor ML models from validation to production
The fastest way to build data pipelines
Uncover insights, surface problems, monitor, and fine tune your LLM
Python Client for Supabase. Query Postgres from Flask, Django
AI discovers 520000 stable inorganic crystal structures for research
Helps data scientists define testable self-documenting dataflows
Label Studio is a multi-type data labeling and annotation tool
A modular, primitive-first, python-first PyTorch library
Detecting silent model failure. NannyML estimates performance
A unified framework for machine learning with time series
DoWhy is a Python library for causal inference
Extensible, parallel implementations of t-SNE
machine learning tutorials (mainly in Python3)
A high performance implementation of HDBSCAN clustering
MLOps simplified. From ML Pipeline ⇨ Data Product without the hassle
A curated list of data mining papers about fraud detection
Training data (data labeling, annotation, workflow) for all data types