Data science

Python packages for data science

ⓘ  This is a selection of open source tools suggested by Mediafutures mentors for the 1st Open Call. Participants are free to use these or other tools.

Scrapy


Fast high-level web crawling & scraping framework for Python

NLTK


Suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing

Gensim


Python library for topic modelling, document indexing and similarity retrieval with large corpora

scikit-learn


Python module for machine learning built on top of SciPy and is distributed under the 3-Clause BSD license

NetworkX


Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks

Plotly


Interactive graphing library for Python

Matplotlib


Comprehensive library for creating static, animated, and interactive visualizations in Python