All Classes and Interfaces
Class
Description
The BuilderValidator class provides utility methods for validating builder parameters.
The Connection record represents a connection between a parent URI and a child URI, along with
the depth of the connection.
This class contains constants used throughout the crawling process.
The DocumentParser interface defines methods for extracting relevant information from HTML
documents.
The ErrorResultPage record represents a result page containing an error encountered during web
crawling.
The ExportConfig interface defines methods for configuring data export options.
The ExportFileFormat enum represents the file formats supported for data export.
This utility class provides file-related operations.
The JsonExportConfig record represents the configuration for exporting data in JSON format.
A concrete implementation of the
JWeaverCrawler.Builder
interface used to configure and
build instances of JWeaverCrawler
.Represents the JWeaverCrawler abstract class, which facilitates web crawling operations.
The Builder interface provides methods for building and customize an instance of
JWeaverCrawler.
A concrete implementation of
JWeaverCrawler
providing web crawling functionality.The JWeaverDocumentParser class is responsible for parsing HTML documents to extract relevant
information.
The JWeaverExecutionException class represents an unchecked exception that occurs during the
execution of a JWeaver task.
A concrete implementation of the
JWeaverWriter
interface for writing data to files.Handles the crawling process for a base URI *
The JWeaverWriter interface defines methods for processing and writing the results of the web
crawling process.
The MarkdownExportConfig record represents the configuration for exporting data in Markdown
format.
The Metadata record represents metadata associated with a web page.
The NodeError record represents an error associated with a specific node during web crawling.
The OutputFileException class represents an unchecked exception that occurs when there is an
issue with an output file or directory.
The PageLink record represents a link to a web page along with its depth in the crawling
hierarchy.
The ResponseData record represents the response data received from a web request.
The ResultPage interface represents a result page obtained during web crawling.
The SuccessResultPage record represents a successful result page obtained during web crawling.
The TaskExecutor interface defines methods for executing tasks either in parallel or
sequentially.
A concrete implementation of the
TaskExecutor
interface responsible for executing tasks.The URIHelper class provides utility methods for handling and validating URIs.