• Stars
    star
    101
  • Rank 338,166 (Top 7 %)
  • Language
    Java
  • License
    GNU Lesser Genera...
  • Created over 12 years ago
  • Updated about 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Implementation of Vision Based Page Segmentation algorithm in Java

vips_java

Implementation of Vision Based Page Segmentation algorithm in Java.

The implementation utilizes CSSBox (X)HTML/CSS rendering engine written by Radek Burget.

Description of VIPS and my implementation in my master's thesis (in Czech)

http://www.fit.vutbr.cz/study/DP/DP.php?id=14163&file=t

Original work by Microsoft

http://www.cad.zju.edu.cn/home/dengcai/VIPS/VIPS_July-2004.pdf

CSSBox

http://cssbox.sourceforge.net

Compilation

This project uses Apache Maven. Compile it by running mvn package and run java -cp target/vips-java-*jar-with-dependencies.jar org.fit.vips.VipsTester to start the VipsTester.

Usage

Just pass the URL of web page you want to analyze as argument to VipsTester class.

Preferences of implementation can be changed also there.