Parse the NETSCAPE-Bookmark-file-1 format bookmarks exported by the browser, convert it into a tree structure, and also convert the tree structure back to bookmarks.

HTML 13 2 Updated Sep 26, 2024

zapolnoch / node-tesseract-ocr

A Node.js wrapper for the Tesseract OCR API

JavaScript 309 38 Updated Jul 13, 2023

biolab / orange3

🍊 📊 💡 Orange: Interactive data analysis

Python 4,934 1,026 Updated Dec 23, 2024

kedro-org / kedro

Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…

Python 10,071 910 Updated Dec 24, 2024

ploomber / ploomber

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Python 3,533 237 Updated Sep 18, 2024

mara / mara-pipelines

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

Python 2,084 103 Updated Dec 15, 2023

EntilZha / PyFunctional

Python library for creating data pipelines with chain functional programming

Python 2,408 133 Updated Jun 21, 2024

axa-group / Parsr

Transforms PDF, Documents and Images into Enriched Structured Data

JavaScript 5,882 310 Updated Dec 3, 2023

mozilla / pdf.js

PDF Reader in JavaScript

JavaScript 49,138 10,089 Updated Dec 29, 2024

marceloprates / prettymaps

A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.

Jupyter Notebook 11,398 538 Updated Jul 6, 2024

nisaacson / pdf-extract

Node PDF Extract

JavaScript 385 76 Updated Aug 14, 2023

castorini / hedwig

PyTorch deep learning models for document classification

Python 595 126 Updated Jul 21, 2023

gnu-octave / octave

GNU Octave Mirror (https://www.octave.org/hg/octave). Report bugs and submit pull requests (patches) at https://bugs.octave.org

C++ 422 59 Updated Dec 29, 2024

airbytehq / airbyte

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 16,588 4,214 Updated Dec 29, 2024

Nike-Inc / koheesio

Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.

Python 613 28 Updated Dec 20, 2024

pola-rs / polars

Dataframes powered by a multithreaded, vectorized query engine, written in Rust

Rust 31,142 2,027 Updated Dec 29, 2024

pandas-dev / pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 44,128 18,095 Updated Dec 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eric U1F30C

Achievements

Achievements

Block or report U1F30C

📊 Data utilities

qgis / QGIS

mbloch / mapshaper

qminer / qminer

plotly / plotly-nodejs

Waikato / weka-3.8

tensorflow / tfjs

manuelbieh / geolib

stackgl / headless-gl

Turfjs / turf

ariya / phantomjs

NaturalIntelligence / fast-xml-parser

cheeriojs / cheerio

evansiroky / node-geo-tz

zmyjs / netscape-bookmark-tree