Skip to content

An webhosted app (latex2wordcloud.herokuapp.com) providing text cleaning and analysis functionality for general text and LaTeX formatted text documents.

Notifications You must be signed in to change notification settings

walkenho/LaTeX2Wordcloud

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LaTeX 2 Wordcloud

A Streamlit app that allows the user to clean text data and visualize word frequencies.

The app offers several data cleaning steps and in the visualizes the most frequent words in the text using a bar chart.

As special feature, it uses the LaTeXStripper library to allows users to clean LaTeX formatted files from their formatting allowing the analysis of the actual content instead of formatting features ;)

It currently offers the following cleaning options:

  • Split Hyphenation
  • Lemmatization
  • Deletion of Stopwords
  • Deletion of Punctuation
  • Deletion of Single Characters
  • Deletion of LaTeX Formatting

A beta version is currently deployed under the following URL: https://latex2wordcloud.herokuapp.com/

About

An webhosted app (latex2wordcloud.herokuapp.com) providing text cleaning and analysis functionality for general text and LaTeX formatted text documents.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages