Sponsored by

lucavallin
Sponsor?

About

First Issue curates accessible issues from popular open-source projects, and helps you make your next contribution to open-source.

Join the Newsletter

Join "The lucavallin Newsletter" to receive curated issues from FirstIssue and other articles in your inbox every other week.

Sort Repositories

Data-Centric Pipelines and Data Versioning
lang: Go
stars: 6K
last activity:
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
lang: Python
stars: 39.9K
last activity:
Apache Superset is a Data Visualization and Data Exploration Platform
lang: TypeScript
stars: 54.6K
last activity:
matplotlib: plotting with Python
lang: Python
stars: 18.2K
last activity:
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
lang: Python
stars: 3.6K
last activity:
Python Data. Leaflet.js Maps.
lang: Python
stars: 6.4K
last activity:
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
lang: Go
stars: 3.9K
last activity:
Statsmodels: statistical modeling and econometrics in Python
lang: Python
stars: 8.9K
last activity:
Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
lang: Python
stars: 13K
last activity:
iterative / dvc
🦉 Data Version Control | Git for Data & Models | ML Experiments Management
lang: Python
stars: 12.1K
last activity:
STUMPY is a powerful and scalable Python library for modern time series analysis
lang: Python
stars: 2.8K
last activity:
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba 来自阿里巴巴的一站式大规模图计算系统 图分析 图查询 图机器学习
lang: Rust
stars: 2.9K
last activity:
microsoft / nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
lang: Python
stars: 13.3K
last activity:
10 Weeks, 20 Lessons, Data Science for All!
lang: Jupyter Notebook
stars: 22.7K
last activity:
OpenRefine is a free, open source power tool for working with messy data and improving it
lang: Java
stars: 9.7K
last activity: