Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 18 additions & 0 deletions .github/workflows/check-links.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
name: Check for broken links
on:
push:
branches: [ main, master ]
pull_request:
branches: [ main, master ]
schedule:
- cron: '0 9 * * 1'
jobs:
check-links:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- name: Link Checker
uses: lycheeverse/lychee-action@v1.8.0
with:
args: --verbose --max-retries 3 --timeout 10 '**/*.html' '**/*.md'
fail: true
12 changes: 12 additions & 0 deletions CITATION.cff
Original file line number Diff line number Diff line change
@@ -0,0 +1,12 @@
cff-version: 1.2.0
message: "If you use this software or dataset, please cite it as below."
type: software
authors:
- family-names: "de la Serna"
given-names: "Juan Moisés"
orcid: "https://orcid.org/0000-0002-8401-8018"
affiliation: "Universidad Internacional de La Rioja (UNIR)"
title: "awesome-python-data-science"
version: "1.0.0"
date-released: "2026-04-06"
url: "https://github.com/juanmoisesd/awesome-python-data-science"
10 changes: 10 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -487,6 +487,7 @@
* [cleanlab](https://github.com/cleanlab/cleanlab) - The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
* [snorkel](https://github.com/snorkel-team/snorkel) - A system for quickly generating training data with weak supervision.
* [dataprep](https://github.com/sfu-db/dataprep) - Collect, clean, and visualize your data in Python with a few lines of code.
* [latamdata-py](https://github.com/juanmoisesd/latamdata-py) - One-line access to 38 open research datasets from Latin America (health, neuroscience, mental health, economics). Real-world data for research and ML.

### Synthetic Data

Expand Down Expand Up @@ -565,3 +566,12 @@ Read the <a href=https://github.com/krzjoa/awesome-python-datascience/blob/maste

## License
This work is licensed under the Creative Commons Attribution 4.0 International License - [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/)

## How to Cite

If you use this repository in your research, please cite:

> de la Serna, J. M. (2026). *Awesome Python Data Science*. Universidad Internacional de La Rioja (UNIR).
> https://github.com/juanmoisesd/awesome-python-data-science

See `CITATION.cff` for formatted references.