WebScrapBook is a browser extension that captures the web page faithfully with various archive formats and customizable configurations, for future retrieval, organization, annotation, and editing. This project inherits from legacy Firefox add-on ScrapBook X.
Features:
- Capture faithfully: A web page shown in the browser can be captured without losing any subtle detail. Metadata such as source URL and timestamp are also recorded.
- Customizable capture: WebScrapBook can save selected area in a page, save source page (before processed by scripts), or save page as a bookmark. How to capture images, audio, video, fonts, frames, styles, scripts, etc. are also customizable. A web page can be saved as a folder, a ZIP-based archive file (HTZ or MAFF), or a single HTML file.
- Organizable collections: Captured pages can be organized in the browser sidebar using one or more "scrapbooks". A scrapbook holds a hierarchical tree structure to organize data items, and can be further indexed for a rich-feature search (using a combination of title, fulltext keywords, custom comment, source URL, or other metadata). (*)
- Page editing: A web page can be highlighted, annotated, or edited before or after a capture. You can additionally create and manage notes using HTML or markdown format. (*)
- Remote access: Captured data can be hosted with a central backend server and be read or edited from other devices. Alternatively, a static site index can be generated for a scrapbook, which can therefore be hosted on a shared web server that doesn't support dynamic web hosting. (*)
- Mobile support: WebScrapBook supports mobile browsers such as Firefox for Android and Kiwi browser.
- Legacy ScrapBook support: Scrapbooks created from legacy ScrapBook (X) can be converted into WebScrapBook-compliant format for usage. (*)
• All or partial functionality of a starred feature above requires a running collaborating backend server, which can be easily set up using PyWebScrapBook.
Comments and Reviews
I used to use the old Scrapbook extension for Firefox which became defunct with Firefox 57. WebScrapBook is a perfect replacement for newer browsers.
(Web)ScrapBook is amongst the most useful browser extension out there. It allows you to save and, annotate and markup (highlight) webpages that you come across whilst browsing the internet. It is an excellent tool for research. And on top its free. This extension deserves to be more popular.
The unique selling point of (Web)ScrapBook is, that it completely resides within your browser. It's not connected to any service that siphons off your personal and private data.