What is the best approach to copy public dynamic pages?
- by Renan
Situation: the government is supposed to publish official information online such as acts and laws.
Problem: they're using 90s expertise to do it. You can tell that by the constant use of deprecated html tags such as <table and the
lack of any compression at all, which makes some documents go way
over 700,000 bytes even though they're pure text.
Side problem: some companies are actually editing and selling this content that should be public and free.
What I need to know is the best approach to offer said official content in my own site for free.
I've thought of setting up a mirror to copy the official pages from time to time, since some of them are updated frequently, which would automatically be compressed as all my pages are via htaccess.