Posts tagged ‘java’

Norconex HTTP Collector 2.8.0 released

Norconex is proud to announce the release of Norconex HTTP Collector version 2.8.0.  This release is accompanied by new releases of many related Norconex open-source products (Filesystem Collector, Importer, Committers, etc.), and together they bring dozens of new features and ... Read More...


Posted on November 26, 2017 by in Latest Releases


Norconex HTTP and Filesystem Collector 2.7.0 released

Norconex released version 2.7.0 of both its HTTP Collector and Filesystem Collector.  This update, along with related component updates, introduces several interesting features. HTTP Collector changes The following items are specific to the HTTP Collector.  For changes applying to both the ... Read More...


Posted on April 26, 2017 by in Latest Releases


Norconex Importer 2.4.0 released

Norconex is proud to release version 2.4.0 of its Norconex Importer open-source product.  In addition to the usual bug fixes and stability enhancements, this release provides more possibilities for parsing and enriching your documents.  Most significantly, Importer 2.4.0 allows for ... Read More...


Posted on November 2, 2015 by in Latest Releases


Norconex Importer 2.2.0 released

This release of Norconex Importer brings many fixes, increased stability, and nice new features. The following highlights some of the additions with XML configuration or Java code samples. Retrieve a document Length [ezcol_1half] Thanks to the new DocumentLengthTagger, you can ... Read More...


Posted on June 15, 2015 by in Latest Releases


Data Mining with Solr 5 – How to Slice and Dice Your Data With Facet Pivot and the Stats Module

Introduction You already know that Solr is a great search application, but did you know that Solr 5 could be used as a platform to slice and dice your data?  With Pivot Facet working hand in hand with Stats Module, ... Read More...


Posted on April 9, 2015 by in Latest Articles


Norconex Importer 2.1.0 released

This feature release of Norconex Importer brings bug fixes, enhancements, and great new features, such as OCR and translation support.  Keep reading for all the details on some of this release’s most interesting changes. While Java can be used to ... Read More...


Posted on April 1, 2015 by in Latest Releases


Norconex Commons Lang 1.6.0 Released

Release 1.6.0 of Norconex Commons Lang provides new Java utility classes and enhancements to existing ones: New Classes TimeIdGenerator [ezcol_1half] Use TimeIdGenerator when you need to generate numeric IDs that are unique within a JVM. It generates Java long values ... Read More...


Posted on March 27, 2015 by in Latest Releases


Create a website broken links checker

This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and create a report ... Read More...


Posted on February 10, 2015 by in Latest Articles


Major upgrades to Norconex crawlers

Norconex just released major upgrades to all its Norconex Collectors and related projects.  That is, Norconex HTTP Collector and Norconex Filesystem Collector, along with the Norconex Importer module and all available committers (Solr, Elasticsearch, HP IDOL, etc), were all upgraded ... Read More...


Posted on November 27, 2014 by in Latest Releases


Norconex Announces Availability of Norconex Filesystem Collector

GATINEAU, QC, CANADA – Thursday, August 25, 2014 – Norconex is announcing the launch of Norconex Filesystem Collector, providing organizations with a free “universal” filesystem crawler. The Norconex Filesystem Collector enables document indexing into target repositories of choice, such as ... Read More...


Posted on August 25, 2014 by in Latest Releases