Skip to content

boilerpipe 1.2.2 - a fork from 1.2.0 with additional features

License

Notifications You must be signed in to change notification settings

dbichko/boilerpipe

Repository files navigation

#Changes in this Version of Boilerpipe# This is an extended version of Boilerpipe.

This are the new features:

  • Media extraction (Youtube videos, Vimeo videos and images)
  • Extract an article with his HTML structure

Classes added:

  • de.l3s.boilerpipe.document.Media
  • de.l3s.boilerpipe.document.Video
  • de.l3s.boilerpipe.document.VimeoVideo
  • de.l3s.boilerpipe.document.YoutubeVideo
  • de.l3s.boilerpipe.extractors.HtmlArticleExtractor - this Extractor extracts the article content with his basic HTML structure from the document
  • de.l3s.boilerpipe.sax.MediaExtractor - this Extractor returns a list of Medias that are contained in the documents article

files/folders removed (because of the change to maven):

  • /build.xml
  • /eclipse-build
  • /lib

About

boilerpipe 1.2.2 - a fork from 1.2.0 with additional features

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published