-
Notifications
You must be signed in to change notification settings - Fork 262
Home
Jake Dodd edited this page May 12, 2015
·
26 revisions
Registering Metadata for Serialization: If your topology doesn't extend ConfigurableTopology
, you will need to manually register storm-crawler's Metadata
class for serialization in Storm.
- Protocols: Network protocols that are usable in storm-crawler
- SiteMapParserBolt: how to handle sitemaps
- JSoupParserBolt: parse HTML documents
- URLFilters: how to filter or normalise outlinks
- Configuration: how to configure the storm-crawler
- Start
- Components
- Filters
- Bolts
- Protocol
- Metadata
- Resources