[mod_python] httplib questions

Sun Apr 25 12:25:15 EST 2004

Your question isn't really about mod_python; rather, it's about making the
best use of the httplib module.  

You might want to use urllib.urlopen() or urllib.urlretrieve() instead.  The
latter class will follow redirections transparently.  

If you must use httplib, you'll need to parse the response yourself, and
then make a subsequent connection to the target URL specified in the
redirection response.

Read the relevant HTTP-related RFCs (http://www.faqs.org/rfcs/) if you'd
like to understand the inner workings of the HTTP protocol.

--Michael

  _____  

From: mod_python-bounces at modpython.org
[mailto:mod_python-bounces at modpython.org] On Behalf Of Doug Gray
Sent: Sunday, April 25, 2004 11:45 AM
To: mod_python at modpython.org
Subject: [mod_python] httplib questions

Im trying to capture information from a site that I need to log into first
but Im having problems making it work.

Python 2.3.3

Im using the following example as my start

Import httplib,urllib

params = urllib.urlencode({'username': 'test', 'password': 'test')

headers = {"Content-type": "application/x-www-form-urlencoded","Accept":
"text/plain"}

conn = httplib.HTTPConnection("fantasygames.sportingnews.com")

conn.request("POST", "/crs/home_check_reg.html", params, headers)

response = conn.getresponse()

return response.status

the status that is returned is 302 which I think is something to do with
redirect.

Questions.

1.     How do I handle 302s??

2.     can sites block you from capture data in which you are a member of
that site.

3.     Is there a good forum for Python so I can do a detailed search on the
topic of capturing web source from websites.

Thanks

Doug

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.modpython.org/pipermail/mod_python/attachments/20040425/9f86a56e/attachment-0001.html