Archives for the ‘Latest Articles’ category

How to crawl Facebook

Despite all the “noise” on social media sites, we can’t deny how valuable information found on social media networks can be for some organizations. Somewhat less obvious is how to harvest that information for your own use. You can find ... Read More...


Posted on February 5, 2015 by in Latest Articles


Monitor your crawler’s progress with JEF Monitor

On large environments, it’s common to have many crawlers running at once, or at scheduled intervals, in order to keep your collected content up-to-date. For example, this is a typical requirement of search engines installations. They need their internal indices ... Read More...


Posted on January 7, 2015 by in Latest Articles


Facets with Lucene

During the development of our latest product, Norconex Content Analytics, we decided to add facets to the search interface. They allow for exploring the indexed content easily. Solr and Elasticsearch both have facet implementations that work on top of Lucene. ... Read More...


Posted on August 1, 2014 by in Latest Articles


Google Search Appliance (GSA): A journey into an accessible responsive web design

As a Search Expert at Norconex, I am often assigned the task of integrating web accessibility standards within a search user interface for our customers in the government sector; these customers look to web accessibility to improve the overall search ... Read More...


Posted on February 26, 2014 by in Latest Articles


An Open-Source Crawler for Autonomy IDOL

HP Autonomy users, take control over your web crawling. Norconex recently released an HP Autonomy IDOL Committer module for its open-source web crawler, Norconex HTTP Collector. You can now enjoy the features of Norconex crawler and experience the freedom of ... Read More...


Posted on January 15, 2014 by in Latest Articles


Upgrading code to Lucene 4

For a client last year, we had to upgrade some old Lucene code to Lucene 4. Lucene 4 was a rather large release and there are many aspects to be aware when upgrading non trivial code. Let’s take a look ... Read More...


Posted on January 13, 2014 by in Latest Articles


A true winner!

Norconex is glad to help Sophie Carrier-Laforte, an outstanding amateur athlete who is targeting the biggest honors again this year, going further.  I have had the privilege to know Sophie for a long while now and I have seen her progression as an athlete. ... Read More...


Posted on November 24, 2013 by in Latest Articles


Serving autocomplete suggestions fast!

Autocomplete (also known as live suggestions or search suggestions) is very popular with Search applications. It is generally used to return either query suggestion (à la Google Autocomplete) or to propose existing search results (à la Facebook). Open source search ... Read More...


Posted on October 9, 2013 by in Latest Articles


Exploring Norconex Commons Lang

Norconex Commons Lang is a generic Java library providing useful utility classes that extend the base Java API.  Its name is shamelessly borrowed from Apache Commons Lang, so people can quickly assume what it’s about just by its name.   It ... Read More...


Posted on September 19, 2013 by in Latest Articles


Using Norconex HTTP Collector with LucidWorks

During a recent client project, I was required to crawl several websites with specific requirements for each.  For example, one of the websites required: to have a meta tag content be used as a URL replacement for the actual URL, ... Read More...


Posted on August 28, 2013 by in Latest Articles


Sorry, no posts matched your criteria.