π Data utilities
QGIS is a free, open source, cross platform (lin/win/mac) geographical information system (GIS)
Tools for editing Shapefile, GeoJSON, TopoJSON and CSV files
Analytic platform for real-time large-scale streams containing structured and unstructured data.
node.js wrapper for Plotly's Chart Studio Streaming and REST APIs
No longer updated mirror of the Weka 3.8 branch.
A WebGL accelerated JavaScript library for training and deploying ML models.
Zero dependency library to provide some basic geo functions
A modular geospatial engine written in JavaScript and TypeScript
Validate XML, Parse XML and Build XML rapidly without C/C++ based libraries and no callback.
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
A node.js module to find the timezone based on gps coordinates
Parse the NETSCAPE-Bookmark-file-1 format bookmarks exported by the browser, convert it into a tree structure, and also convert the tree structure back to bookmarks.
A Node.js wrapper for the Tesseract OCR API
π π π‘ Orange: Interactive data analysis
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,β¦
The fastest β‘οΈ way to build data pipelines. Develop iteratively, deploy anywhere. βοΈ
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
Python library for creating data pipelines with chain functional programming
Transforms PDF, Documents and Images into Enriched Structured Data
A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.
PyTorch deep learning models for document classification
GNU Octave Mirror (https://www.octave.org/hg/octave). Report bugs and submit pull requests (patches) at https://bugs.octave.org
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
Dataframes powered by a multithreaded, vectorized query engine, written in Rust
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more