Posts tagged ‘code’

Norconex Importer 2.1.0 released

This feature release of Norconex Importer brings bug fixes, enhancements, and great new features, such as OCR and translation support.  Keep reading for all the details on some of this release’s most interesting changes. While Java can be used to ... Read More...


Posted on April 1, 2015 by in Latest Releases


Norconex Commons Lang 1.6.0 Released

Release 1.6.0 of Norconex Commons Lang provides new Java utility classes and enhancements to existing ones: New Classes TimeIdGenerator [ezcol_1half] Use TimeIdGenerator when you need to generate numeric IDs that are unique within a JVM. It generates Java long values ... Read More...


Posted on March 27, 2015 by in Latest Releases


Create a website broken links checker

This tutorial will show you how to extend Norconex HTTP Collector using Java to create a link checker to ensure all URLs in your web pages are valid. The link checker will crawl your target site(s) and create a report ... Read More...


Posted on February 10, 2015 by in Latest Articles


How to crawl Facebook

Despite all the “noise” on social media sites, we can’t deny how valuable information found on social media networks can be for some organizations. Somewhat less obvious is how to harvest that information for your own use. You can find ... Read More...


Posted on February 5, 2015 by in Latest Articles


Norconex Commons Lang 1.5.0 Released

This feature release brings the following additions... Simple Pipeline Useful if you want to quickly assemble multiple tasks to be run into a single "pipeline" while keeping it ultra simple.  The following example does it all in a single class ... Read More...


Posted on November 24, 2014 by in Latest Releases


Facets with Lucene

During the development of our latest product, Norconex Content Analytics, we decided to add facets to the search interface. They allow for exploring the indexed content easily. Solr and Elasticsearch both have facet implementations that work on top of Lucene. ... Read More...


Posted on August 1, 2014 by in Latest Articles


Norconex Commons Lang 1.4.0 Released

Norconex Commons Lang 1.4.0 was just released. New features: New DataUnit classe to perform data unit (KB, MB, GB, etc) conversions much like Java TimeUnit class. New DataUnitFormatter to format any data unit ot a human-readable format taking into account ... Read More...


Posted on July 10, 2014 by in Latest Releases


Upgrading code to Lucene 4

For a client last year, we had to upgrade some old Lucene code to Lucene 4. Lucene 4 was a rather large release and there are many aspects to be aware when upgrading non trivial code. Let’s take a look ... Read More...


Posted on January 13, 2014 by in Latest Articles


Exploring Norconex Commons Lang

Norconex Commons Lang is a generic Java library providing useful utility classes that extend the base Java API.  Its name is shamelessly borrowed from Apache Commons Lang, so people can quickly assume what it’s about just by its name.   It ... Read More...


Posted on September 19, 2013 by in Latest Articles