All Classes and Interfaces

Class
Description
The BuilderValidator class provides utility methods for validating builder parameters.
The Connection record represents a connection between a parent URI and a child URI, along with the depth of the connection.
This class contains constants used throughout the crawling process.
The DocumentParser interface defines methods for extracting relevant information from HTML documents.
The ErrorResultPage record represents a result page containing an error encountered during web crawling.
The ExportConfig interface defines methods for configuring data export options.
The ExportFileFormat enum represents the file formats supported for data export.
This utility class provides file-related operations.
The JsonExportConfig record represents the configuration for exporting data in JSON format.
A concrete implementation of the JWeaverCrawler.Builder interface used to configure and build instances of JWeaverCrawler.
Represents the JWeaverCrawler abstract class, which facilitates web crawling operations.
The Builder interface provides methods for building and customize an instance of JWeaverCrawler.
A concrete implementation of JWeaverCrawler providing web crawling functionality.
The JWeaverDocumentParser class is responsible for parsing HTML documents to extract relevant information.
The JWeaverExecutionException class represents an unchecked exception that occurs during the execution of a JWeaver task.
A concrete implementation of the JWeaverWriter interface for writing data to files.
Handles the crawling process for a base URI *
The JWeaverWriter interface defines methods for processing and writing the results of the web crawling process.
The MarkdownExportConfig record represents the configuration for exporting data in Markdown format.
The Metadata record represents metadata associated with a web page.
The NodeError record represents an error associated with a specific node during web crawling.
The OutputFileException class represents an unchecked exception that occurs when there is an issue with an output file or directory.
The PageLink record represents a link to a web page along with its depth in the crawling hierarchy.
The ResponseData record represents the response data received from a web request.
The ResultPage interface represents a result page obtained during web crawling.
The SuccessResultPage record represents a successful result page obtained during web crawling.
The TaskExecutor interface defines methods for executing tasks either in parallel or sequentially.
A concrete implementation of the TaskExecutor interface responsible for executing tasks.
The URIHelper class provides utility methods for handling and validating URIs.