• Stars
    star
    16
  • Rank 1,311,288 (Top 26 %)
  • Language
    Java
  • Created over 11 years ago
  • Updated over 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Java implementation of the Internet Research Lab Web Crawler (IRLbot) as presented by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, and Dmitri Loguinov in their paper "IRLbot: Scaling to 6 Billion Pages and Beyond"

More Repositories

1

spring-security-acl-mongodb

Spring Security MongoDB based access control list (ACL) implementation
Java
30
star
2

ContextExtraction

Online news article (HTML pages) context extraction using Maximum Subsequence Segmentation Algorithm as presented by Pasternack and Roth
Java
16
star
3

JDiff

Java implementation of Myers Diff algorithm, based on a port from the C# implementation done by Nicholas Butler at http://simplygenius.net/Article/DiffTutorial1 or http://www.codeproject.com/Articles/42279/Investigating-Myers-diff-algorithm-Part-1-of-2
Java
15
star
4

ts-edifact

Typescript port of the node-edifact project
TypeScript
13
star
5

camel-rest-dsl-with-spring-security

Sample application on using Apache Camel with its REST DSL feature in combination with Spring Security
Java
12
star
6

JDrum

Java implementation of the disk repository with update management (DRUM) framework as presented by Hsin-Tsang Lee, Derek Leonard, Xiaoming Wang, and Dmitri Loguinov in the paper "IRLbot: Scaling to 6 Billion Pages and Beyond"
Java
7
star
7

CamelCxfJetty

Test project for the usage of Apache Camel's routing functionality in combination with an Apache CXF managed service on top of a Jetty configured server
Java
4
star
8

Classifier

Is a Java classification framework currently supporting Naive Bayes and a libSVM port as well as a fastC45 port
Java
3
star
9

JNAFileInfo

Sample on how to use JNA with FileVersion to gain information on data stored in executable files. This will only work on Windows 2k and newer versions
Java
3
star
10

PluginApplication

A basic Java plugin architecture with dependency injection and singleton support. The plugin application avoids file-locking
Java
3
star
11

RuleBasedEngine

Simple rule-based enginde done in Java
Java
3
star
12

Common

Contains classes and functions used by a couple of internal frameworks
Java
1
star
13

PorterStemmer

Implementation of Porter's English stemmer algorighm
Java
1
star
14

spring-security-samples-acl-mongodb

Customized Spring Security ACL contact sample which works with a MongoDB based ACL service
Groovy
1
star
15

University

Projects created for university courses
Java
1
star
16

Parser

Simple HTML Parser Framework
Java
1
star
17

CamelMultipleJettyComponents

Showcases a simple Apache Camel setup which uses multiple Jetty components to build a REST based service
Java
1
star
18

CxfWsPojoTest

Simple test-project for POJO support of Camel's CXF extension
Java
1
star