Posts tagged ‘Importer’

Norconex HTTP Collector 2.8.0 released

Norconex is proud to announce the release of Norconex HTTP Collector version 2.8.0.  This release is accompanied by new releases of many related Norconex open-source products (Filesystem Collector, Importer, Committers, etc.), and together they bring dozens of new features and ... Read More...


Posted on November 26, 2017 by in Latest Releases


Norconex HTTP and Filesystem Collector 2.7.0 released

Norconex released version 2.7.0 of both its HTTP Collector and Filesystem Collector.  This update, along with related component updates, introduces several interesting features. HTTP Collector changes The following items are specific to the HTTP Collector.  For changes applying to both the ... Read More...


Posted on April 26, 2017 by in Latest Releases


Norconex HTTP Collector 2.6.0 released

Norconex has released version 2.6.0 of its HTTP Collector web crawler! Among new features, an upgrade of its Importer module brings new document parsing and manipulating capabilities. Some of the changes highlighted here also benefit the Norconex Filesystem Collector. New ... Read More...


Posted on August 25, 2016 by in Latest Releases


Norconex Importer 2.4.0 released

Norconex is proud to release version 2.4.0 of its Norconex Importer open-source product.  In addition to the usual bug fixes and stability enhancements, this release provides more possibilities for parsing and enriching your documents.  Most significantly, Importer 2.4.0 allows for ... Read More...


Posted on November 2, 2015 by in Latest Releases


Norconex Importer 2.2.0 released

This release of Norconex Importer brings many fixes, increased stability, and nice new features. The following highlights some of the additions with XML configuration or Java code samples. Retrieve a document Length [ezcol_1half] Thanks to the new DocumentLengthTagger, you can ... Read More...


Posted on June 15, 2015 by in Latest Releases


Norconex Importer 2.1.0 released

This feature release of Norconex Importer brings bug fixes, enhancements, and great new features, such as OCR and translation support.  Keep reading for all the details on some of this release’s most interesting changes. While Java can be used to ... Read More...


Posted on April 1, 2015 by in Latest Releases


Major upgrades to Norconex crawlers

Norconex just released major upgrades to all its Norconex Collectors and related projects.  That is, Norconex HTTP Collector and Norconex Filesystem Collector, along with the Norconex Importer module and all available committers (Solr, Elasticsearch, HP IDOL, etc), were all upgraded ... Read More...


Posted on November 27, 2014 by in Latest Releases


Norconex Importer 1.3.0 Released

Release 1.3.0 of Norconex Importer is now available.  Release overview: Now stores the content “family” for each documents as “importer.contentFamily”. New SplitTagger: Split values into multiple-values using a separator of choice. New CopyTagger: copies document metadata fields to other fields. ... Read More...


Posted on August 19, 2014 by in Latest Releases


Norconex Importer 1.2.0 Released

Norconex Importer 1.2.0 was just released along with a new website for it. New features: Now support text extraction from WordPerfect documents. New transformer to reduce consecutive instances of the same string to only one instance. New transformer to perform ... Read More...


Posted on March 9, 2014 by in Latest Releases