Like many others, I have a process for bookmarking web pages to read later. My requirements for web page bookmarking are:
Ability to bookmark pages must be available from all (within reason) platforms - PC/browser, mobile device, etc.
Bookmarks must be centrally stored (implicit from #2) so that I can read
the bookmarks from anywhere/any device
Full text of web pages must be stored
Bonus features would be:
Bookmarks and page content should be full text searchable
Maintain an archive indefinitely
Distinguish between what's read vs. unread
Bookmarked page content is cleaned up, e.g. ads eliminated, unnecessary html removed, pages better formatted for reading
My current process (which addresses most of these requirements) is as follows:
I set up a Gmail account with 2 labels, "Bookmarks Unread" and "Bookmarks Read"
Gmail filters set up such that depending on
the form of
the address (using Gmail's '+string' functionality in addresses),
the incoming bookmark gets labeled appropriately
On each of my browsers/devices, I have an address book entry for
[email protected] and
[email protected].
If I want to clean up
the page content, I use
the Readability bookmarklet which does a great job of giving me
the essential content only
Anywhere I have Firefox, I use
the Send Page by Email extension which, with 2 clicks, allows me to send
the cleaned-up Readability page URL and content to one of
the above email addresses.
Where I don't have Firefox (e.g. iPhone or other mobile device) I use
the native ability to send
the current link via email (most/all apps have them, including
the browser, RSS readers, NYTimes, etc.). In most cases (unless it's built into
the particular app), this won't include
the page body.
The process is almost perfect. I've got
the central access and ubiquitous access of Gmail as
the storage mechanism, full text searchability (due to Gmail, but of course only for
the URLs I send from that Firefox extension), a cleaned up page due to Readability, ability to read offline (assuming I use an IMAP client against Gmail) and permanent archiving of content, including what's been read vs. unread.
The missing pieces are:
The Send Page by Email Firefox extension seems to only send X bytes of a web page. Or some portion. So it limits my full text searchability.
Where I don't have Firefox, I can only send
the link, so no full text search at all in those cases.
Instapaper looks like it meets most of my requirements (and bonus items).
The only downside to me (personal preference) is that central storage is based on Instapaper vs. something more broad like Gmail, which as a generalized service and with Google behind it pretty much means it's permanent. I'm not too hung up on this, but I would definitely prefer to keep Gmail if possible. An upside of Instapaper is that it does
the page clean-up as well as stores
the entire page content, unlike my Firefox extension.
Thoughts on addressing
the gaps and improving this process further?