Spotify currently holds 242 public repositories out of which 19 are related to data science and machine learning.
Name | Description | Language | Stars | License |
---|---|---|---|---|
tfreader | TensorFlow TFRecord reader CLI tool | Scala | 11 | Apache License 2.0 |
flytekit-java | Java/Scala library for easily authoring Flyte tasks and workflows | Java | 10 | Apache License 2.0 |
Name | Description | Language | Stars | License |
---|---|---|---|---|
luigi | Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in. | Python | 13477 | Apache License 2.0 |
annoy | Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk | C++ | 7226 | Apache License 2.0 |
chartify | Python library that makes it easy for data scientists to create charts. | Python | 2658 | Apache License 2.0 |
scio | A Scala API for Apache Beam and Google Cloud Dataflow. | Scala | 1912 | Apache License 2.0 |
heroic | The Heroic Time Series Database | Java | 788 | Apache License 2.0 |
featran | A Scala feature transformation library for data science and machine learning | Scala | 365 | Apache License 2.0 |
big-data-rosetta-code | Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code | Scala | 222 | Apache License 2.0 |
pythonflow | 🐍 Dataflow programming for python. | Python | 192 | Apache License 2.0 |
styx | "The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes. | Java | 180 | Apache License 2.0 |
zoltar | Common library for serving TensorFlow, XGBoost and scikit-learn models in production. | Java | 110 | Apache License 2.0 |
spotify-tensorflow | Provides Spotify-specific TensorFlow helpers | Python | 105 | Apache License 2.0 |
noether | Scala Aggregators used for ML Model metrics monitoring | Scala | 63 | Apache License 2.0 |
pyschema | Python library for class-based schema definition, object serialization and data validation | Python | 56 | Apache License 2.0 |
magnolify | A collection of Magnolia add-on modules | Scala | 38 | Apache License 2.0 |
tfexample-derive | Provides compile-time derivation of conversions between Scala case classes and Tensorflow Example protocol buffers | Scala | 9 | Apache License 2.0 |
scio-contrib | Community-supported add-ons for Scio | Scala | 7 | Apache License 2.0 |
limbo | N/A | Scala | 5 | Apache License 2.0 |