This project is a CLI tool designed to crawl web novels from FreeWebNovel and generate a PDF document containing the chapters. The tool uses Python libraries such as requests
, BeautifulSoup
, and pylatex
to fetch, process, and compile the novel content into a well-formatted PDF.
requests
beautifulsoup4
pylatex
argparse
pip install pageweaver
git clone https://github.com/KTS-o7/pageweaver.git
cd pageweaver
pip install -r requirements.txt
python setup.py install
pageweaver <novel_url> <start_chapter_number> <end_chapter_number> [--output_dir <output_dir>] [--num-workers <num_workers>] [--allow-non-english]
novel_url
: The FreeWebNovel URL of the novel to crawl.start_chapter
: The starting chapter number.end_chapter
: The ending chapter number.--output_dir
: (Optional) The destination directory for the generated PDF. Defaults to the current working directory.--num-workers
: (Optional) The number of workers to use for crawling. Defaults to 10.--allow-non-english
: (Optional) Allow non-English characters in the novel title and author name.pageweaver https://freewebnovel.com/global-fog-survival.html 1 15 --num-workers 5
pageweaver https://freewebnovel.com/global-fog-survival.html 1 30 --output_dir /path/to/output --allow-non-english
To crawl the novel “Global Fog Survival” from chapters 1 to 2 and generate a PDF, run:
pageweaver https://freewebnovel.com/global-fog-survival.html 1 2 --num-workers 10
This will create a PDF document in the current working directory with the title and author extracted from the novel’s metadata.
This project is licensed under the MIT License.
Contributions are welcome! Please open an issue or submit a pull request for any improvements or bug fixes.
For any questions or support, please open an issue on the GitHub repository.
This tool is not intended to promote piracy. It should be used for educational or personal reading purposes only. Please respect the copyrights of the original authors and publishers.