Like many others, I have a process for bookmarking web pages to read later. My requirements for web page bookmarking are:
Ability to bookmark pages must be available from all (within reason) platforms - PC/browser, mobile device, etc.
Bookmarks must be centrally stored (implicit from #2) so that I can read the bookmarks from anywhere/any device
Full text of web pages must be stored
Bonus features would be:
Bookmarks and page content should be full text searchable
Maintain an archive indefinitely
Distinguish between what's read vs. unread
Bookmarked page content is cleaned up, e.g. ads eliminated, unnecessary html removed, pages better formatted for reading
My current process (which addresses most of these requirements) is as follows:
I set up a Gmail account with 2 labels, "Bookmarks Unread" and "Bookmarks Read"
Gmail filters set up such that depending on the form of the address (using Gmail's '+string' functionality in addresses), the incoming bookmark gets labeled appropriately
On each of my browsers/devices, I have an address book entry for
[email protected] and
[email protected].
If I want to clean up the page content, I use the Readability bookmarklet which does a great job of giving me the essential content only
Anywhere I have Firefox, I use the Send Page by
Email extension which, with 2 clicks, allows me to send the cleaned-up Readability page URL and content to one of the above
email addresses.
Where I don't have Firefox (e.g. iPhone or other mobile device) I use the native ability to send the current link via
email (most/all apps have them, including the browser, RSS readers, NYTimes, etc.). In most cases (unless it's built into the particular app), this won't include the page body.
The process is almost perfect. I've got the central access and ubiquitous access of Gmail as the storage mechanism, full text searchability (due to Gmail, but of course only for the URLs I send from that Firefox extension), a cleaned up page due to Readability, ability to read offline (assuming I use an IMAP client against Gmail) and permanent archiving of content, including what's been read vs. unread.
The missing pieces are:
The Send Page by
Email Firefox extension seems to only send X bytes of a web page. Or some portion. So it limits my full text searchability.
Where I don't have Firefox, I can only send the link, so no full text search at all in those cases.
Instapaper looks like it meets most of my requirements (and bonus items). The only downside to me (personal preference) is that central storage is based on Instapaper vs. something more broad like Gmail, which as a generalized service and with Google behind it pretty much means it's permanent. I'm not too hung up on this, but I would definitely prefer to keep Gmail if possible. An upside of Instapaper is that it does the page clean-up as well as stores the entire page content, unlike my Firefox extension.
Thoughts on addressing the gaps and improving this process further?