Apache Nutch

Apache Nutch is a highly extensible and scalable open source web crawler software project.

Apache Nutch
Original author(s)Doug Cutting, Mike Cafarella
Developer(s)Apache Software Foundation
Stable release
1.x1.19 / 22 August 2022 (2022-08-22)
2.x2.4 / 11 October 2019 (2019-10-11)
RepositoryNutch Repository
Written inJava
Operating systemCross-platform
TypeWeb crawler
LicenseApache License 2.0
Websitenutch.apache.org
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.