Archives for 2015

Norconex HTTP Collector 2.3.0 released

Norconex is proud to release version 2.3.0 of its Norconex HTTP Collector open-source web crawler.  Thanks to incredible community feedback and efforts, we have implemented several feature requests, and your favorite crawler is now more stable than ever. The following ... Read More...


Posted on November 6, 2015 by in Latest Releases


Norconex Importer 2.4.0 released

Norconex is proud to release version 2.4.0 of its Norconex Importer open-source product.  In addition to the usual bug fixes and stability enhancements, this release provides more possibilities for parsing and enriching your documents.  Most significantly, Importer 2.4.0 allows for ... Read More...


Posted on November 2, 2015 by in Latest Releases


Lucene/Solr Revolution 2015 Highlights

In collaboration with Pascal Dimassimo. [caption id="attachment_2983" align="alignright" width="600"] Solr committers present at the event[/caption] This year’s conference was held in Austin, Texas on October 15-16, 2015. It gathered around 600 Lucene and Solr enthusiasts from 26 countries, including many ... Read More...


Posted on October 26, 2015 by in Latest Events


Norconex HTTP Collector 2.2.0 Now Available

The latest release of Norconex HTTP Collector provides more content transformation capabilities, canonical URL support, increased stability, and more additional features.   As the Internet grows, so does the demand for better ways to extract and process web data. Several ... Read More...


Posted on July 22, 2015 by in Latest Releases


Norconex Importer 2.2.0 released

This release of Norconex Importer brings many fixes, increased stability, and nice new features. The following highlights some of the additions with XML configuration or Java code samples. Retrieve a document Length [ezcol_1half] Thanks to the new DocumentLengthTagger, you can ... Read More...


Posted on June 15, 2015 by in Latest Releases


Norconex, proud sponsor of girl soccer teams

[ezcol_1half] In the wake of FIFA Women’s World Cup 2015 starting in Canada on June 6th, many young girls are anxiously waiting to see their favorite players compete for their country. Soccer (or “football” outside North America) has been the ... Read More...


Posted on June 3, 2015 by in Latest Events


Use Solr 5 with Docker

Docker is all the rage at the moment! It was recently selected as Gartner Cool Vendor in DevOps. As you may already know, Docker is a platform to build and deploy applications as self-contained units. Those units, called containers, can ... Read More...


Posted on May 1, 2015 by in Latest Articles


Data Mining with Solr 5 – How to Slice and Dice Your Data With Facet Pivot and the Stats Module

Introduction You already know that Solr is a great search application, but did you know that Solr 5 could be used as a platform to slice and dice your data?  With Pivot Facet working hand in hand with Stats Module, ... Read More...


Posted on April 9, 2015 by in Latest Articles


New release of Norconex Collectors

Optical character recognition (ORC), content translation, title generation, detection and text extraction from more file formats, are among the new features now part of your favorite crawlers: Norconex HTTP Collector 2.1.0 and Norconex Filesystem Collector 2.1.0. They are both available ... Read More...


Posted on April 8, 2015 by in Latest Releases


Norconex Importer 2.1.0 released

This feature release of Norconex Importer brings bug fixes, enhancements, and great new features, such as OCR and translation support.  Keep reading for all the details on some of this release’s most interesting changes. While Java can be used to ... Read More...


Posted on April 1, 2015 by in Latest Releases


Sorry, no posts matched your criteria.