Open-source, low-code AutoML platform for Python. PyCaret 4.0: sklearn-native engine + React control plane.
-
Updated
Apr 29, 2026 - Python
Open-source, low-code AutoML platform for Python. PyCaret 4.0: sklearn-native engine + React control plane.
🍊 📊 💡 Orange: Interactive data analysis
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
MTEB: Massive Text Embedding Benchmark
A curated list of community detection research papers with implementations.
A Python toolkit/library for reality-centric machine/deep learning & data mining on partially-observed time series, with 50+ SOTA neural network models for scientific analysis tasks (imputation, classification, clustering, forecasting, anomaly detection, cleaning) on incomplete industrial irregularly-sampled multivariate TS with NaN missing values
Nomic Developer API SDK
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
🔴 MiniSom is a minimalistic implementation of the Self Organizing Maps
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
Time series distances: Dynamic Time Warping (fast DTW implementation in C)
pyclustering is a Python, C++ data mining library.
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
Awesome Deep Graph Clustering is a collection of SOTA, novel deep graph clustering methods (papers, codes, and datasets).
A scikit-learn based module for multi-label et. al. classification
A PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
A robust streaming log template miner based on the Drain algorithm
Add a description, image, and links to the clustering topic page so that developers can more easily learn about it.
To associate your repository with the clustering topic, visit your repo's landing page and select "manage topics."