How to Install and Uninstall libboilerpipe-java Package on Ubuntu 20.10 (Groovy Gorilla)

Last updated: October 06,2024

1. Install "libboilerpipe-java" package

Please follow the step by step instructions below to install libboilerpipe-java on Ubuntu 20.10 (Groovy Gorilla)

$ sudo apt update $ sudo apt install libboilerpipe-java

2. Uninstall "libboilerpipe-java" package

This guide covers the steps necessary to uninstall libboilerpipe-java on Ubuntu 20.10 (Groovy Gorilla):

$ sudo apt remove libboilerpipe-java $ sudo apt autoclean && sudo apt autoremove

3. Information about the libboilerpipe-java package on Ubuntu 20.10 (Groovy Gorilla)

Package: libboilerpipe-java
Architecture: all
Version: 1.2.0-1
Priority: optional
Section: universe/java
Source: boilerpipe
Origin: Ubuntu
Maintainer: Ubuntu Developers
Original-Maintainer: Debian Java Maintainers
Bugs: https://bugs.launchpad.net/ubuntu/+filebug
Installed-Size: 166
Depends: libnekohtml-java, libxerces2-java
Filename: pool/universe/b/boilerpipe/libboilerpipe-java_1.2.0-1_all.deb
Size: 98704
MD5sum: 13305d090cdbad468c5754abbefef216
SHA1: 0eec2dd0a2d7ea8b1665cddcdfeb72f4b9b56a50
SHA256: ddc9d1af8e7b810fa052c4c4a649f51dd364258602c1b979b82038fc90c1c4ff
SHA512: 25e7f6de561d743225139eb63c3ededa8120f259eb813ec3d685b8f40717f3a9b0cfca272094ce4b6b47c3d5f1233786d25c5519a44cc00e32195c65de775647
Homepage: http://code.google.com/p/boilerpipe
Description-en: Boilerplate removal and fulltext extraction from HTML pages
The boilerpipe library provides algorithms to detect and remove the surplus
"clutter" (boilerplate, templates) around the main textual content of a web
page.
.
The library already provides specific strategies for common tasks (for example:
news article extraction) and may also be easily extended for individual problem
settings.
.
Extracting content is very fast (milliseconds), just needs the input document
(no global or site-level information required) and is usually quite accurate.
Description-md5: 8a9654f4b6579b9ec684e87231e38a2d