Skip to content
View perechen's full-sized avatar

Block or report perechen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A python library/command-line tool to quickly and automatically generate BibTeX data starting from the pdf file of a scientific publication.

Python 69 9 Updated Aug 9, 2024

The NORN Poems corpus consists of 3,440 poems published in the 1890s in Norwegian and Danish and encoded in TEI.

1 Updated Oct 15, 2024

This is a selected subset of the Gutenberg corpus.

Python 1 Updated Jan 23, 2025

Visual Studio Code extension for PoeTree

1 Updated Aug 29, 2024

A repository of datasets paired with rich documentation, data essays, and teaching resources

HTML 70 2 Updated Jan 24, 2025

tracing variation in poetic metres via local sequence alignment

Jupyter Notebook 4 1 Updated Jan 24, 2025

An incomplete, unofficial documentation for speedrun.com's new API as used by the web interface

14 3 Updated Jan 7, 2025
Jupyter Notebook 1 Updated Oct 16, 2023

idiolect: An R package for forensic authorship analysis

R 14 3 Updated Oct 5, 2024
Jupyter Notebook 5 1 Updated Nov 14, 2024

Talk rater model for CES 2021 conference

R 14 2 Updated Jun 12, 2021

Website for CHR2023

TeX 1 Updated Dec 6, 2023

A simple text reuse detection CLI tool.

Python 129 25 Updated Jun 17, 2024

Corpus of Hungarian poems in TEI XML with machine annotation

8 2 Updated Dec 11, 2024

python package russtress accentuates russian text

Python 50 11 Updated May 13, 2020
Python 6 1 Updated Jun 6, 2021
HTML 4 1 Updated Sep 21, 2024

Text Re-use Alignment Visualization

JavaScript 38 9 Updated Nov 8, 2017

Lexical Simplification with Pretrained Encoders

Python 70 26 Updated Feb 5, 2021

A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Spanish poetry

Python 29 4 Updated Nov 20, 2021

Thesis RMA Dutch literature and culture at Utrecht University (July 2019)

Jupyter Notebook 1 Updated Apr 8, 2022

Collection of songs from the Dutch Song Database of the Meertens Institute

1 Updated May 28, 2019

中文古诗词语料库

HTML 22 15 Updated Sep 1, 2016

A great intro dataset for data exploration & visualization (alternative to iris).

R 919 216 Updated Sep 19, 2024
Jupyter Notebook 1 1 Updated Dec 8, 2022

A Simple Wolf RPG File Decrypter

C++ 261 49 Updated Nov 18, 2023
Next