escaping query string with special characters with python

Posted by that_guy on Stack Overflow See other posts from Stack Overflow or by that_guy
Published on 2012-06-17T02:49:50Z Indexed on 2012/06/17 3:16 UTC
Read the original article Hit count: 178

Filed under:
|
|
|
|

I got some pretty messy urls that i got via scraping here, problem is that they contain spaces or other special characters in the path and query string, here is some example

http://www.example.com/some path/to the/file.html
http://www.example.com/some path/?file=path to/file name.png&name=name.me

so, is there an easy and robust way to escape the urls so that i can pass them to urlopen? i tried urlib.quote, but it seems to escape the '?', '&', and '=' in the query string as well, and it seems to escape the protocol as well, currently, what i am trying to do is use regex to separate the protocol, path name, and query string and escape them separately, but there are cases where they arent separated properly any advice is appreciated

© Stack Overflow or respective owner

Related posts about python

Related posts about url