Posts tagged ‘HTTP Collector’

Use Norconex crawlers with Apache Kafka

Kafka users rejoice! You can now use Norconex open-source crawlers with Apache Kafka, thanks to the Norconex Apache Kafka Committer. We owe this contribution to Joseph Paulo Mantuano (Senior Developer at The Red Flag Group) and Dan Davis. The Norconex Collectors community keeps growing. We ... Read More...


Posted on July 23, 2019 by in Latest Releases


An Open-Source Crawler for Google Cloud Search

Great news! There is now a Google Cloud Search Committer for Norconex Crawlers! This addition to Norconex Collector family should delight Google Cloud Search fans.  They too can now enjoy the full-featured crawling capabilities offered by Norconex Open-Source crawlers. Since this ... Read More...


Posted on May 31, 2019 by in Latest Releases


AWS Public Sector Summit 2019 in Ottawa

Amazon Web Services (AWS) and the Canadian Public Sector organized another excellent Public Sector Summit on May 15, 2019. AWS hosted the first such summit in Ottawa last year, but this year’s event attracted a much larger crowd. Thousands of ... Read More...


Posted on May 24, 2019 by in Latest Events


Open-Source Crawlers for Neo4j

Norconex crawlers and Neo4j graph database are now a love match! Neo4j is arguably the most popular graph database out there. Use Norconex crawlers to harvest relationships from websites and filesystems and feed them to your favorite graph engine. This was made ... Read More...


Posted on January 15, 2019 by in Latest Releases


How to run Norconex Collector in Docker

Introduction Docker is popular because it makes it easy to package and deliver programs. This article will show you how to run the Java-based, open-source crawler, Norconex HTTP Collector and Elasticsearch Committer in Docker to crawl a website and index ... Read More...


Posted on February 10, 2018 by in Latest Articles


Norconex HTTP Collector 2.8.0 released

Norconex is proud to announce the release of Norconex HTTP Collector version 2.8.0.  This release is accompanied by new releases of many related Norconex open-source products (Filesystem Collector, Importer, Committers, etc.), and together they bring dozens of new features and ... Read More...


Posted on November 26, 2017 by in Latest Releases


Diagrams for Norconex Crawlers

Norconex just made it easier to understand the inner-workings of its crawlers by creating clickable flow diagrams. Those diagrams are now available as part of both the Norconex HTTP Collector and Norconex Filesystem Collector websites. Clicking on a shape will ... Read More...


Posted on May 15, 2017 by in Latest Articles


Indexing to an AWS CloudSearch Domain

Amazon Web Services (AWS) have been all the rage lately, used by many organizations, companies and even individuals. This rise in popularity can be attributed to the sheer number of services provided by AWS, such as Elastic Compute (EC2), Elastic ... Read More...


Posted on May 4, 2017 by in Latest Articles


Norconex HTTP and Filesystem Collector 2.7.0 released

Norconex released version 2.7.0 of both its HTTP Collector and Filesystem Collector.  This update, along with related component updates, introduces several interesting features. HTTP Collector changes The following items are specific to the HTTP Collector.  For changes applying to both the ... Read More...


Posted on April 26, 2017 by in Latest Releases


Norconex HTTP Collector 2.6.0 released

Norconex has released version 2.6.0 of its HTTP Collector web crawler! Among new features, an upgrade of its Importer module brings new document parsing and manipulating capabilities. Some of the changes highlighted here also benefit the Norconex Filesystem Collector. New ... Read More...


Posted on August 25, 2016 by in Latest Releases


Sorry, no posts matched your criteria.