← Back to team overview

pypi-mirror team mailing list archive

[Bug 381432] [NEW] HTMLParseError for package 'xm.booking'

 

Public bug reported:

A fix for this is available in my branch -
http://bazaar.launchpad.net/~srid/+junk/pypi-mirror/revision/66 (also
attaching as patch)

        2009-05-28 12:12:26,482 DEBUG  Processing package xm.booking
	Traceback (most recent call last):
	  File "parts/my-pypi-mirror/z3c.pypimirror/bin/pypimirror", line 14, in <module>
		z3c.pypimirror.mirror.run()
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 720, in run
		config["base_url"])
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 430, in mirror
		links = package.ls(filename_matches, external_links, follow_external_index_pages)
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 290, in ls
		return [(link[0], os.path.basename(link[0]), link[1]) for link in links]
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 258, in _links
		for link in self._fetch_links(remote_index_html):
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 170, in _fetch_links
		soup = BeautifulSoup(html)
	  File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1499, in __init__
	  File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1230, in __init__
	  File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1263, in _feed
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 108, in feed
		self.goahead(0)
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 148, in goahead
		k = self.parse_starttag(i)
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 226, in parse_starttag
		endpos = self.check_for_whole_start_tag(i)
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 301, in check_for_whole_start_tag
		self.error("malformed start tag")
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 115, in error
		raise HTMLParseError(message, self.getpos())
	HTMLParser.HTMLParseError: malformed start tag, at line 22, column 11

** Affects: pypi-mirror
     Importance: Undecided
         Status: New

-- 
HTMLParseError for package 'xm.booking'
https://bugs.launchpad.net/bugs/381432
You received this bug notification because you are a member of PyPI
Mirror, which is the registrant for PyPI Mirror.

Status in PyPI replication project: New

Bug description:
A fix for this is available in my branch - http://bazaar.launchpad.net/~srid/+junk/pypi-mirror/revision/66 (also attaching as patch)

        2009-05-28 12:12:26,482 DEBUG  Processing package xm.booking
	Traceback (most recent call last):
	  File "parts/my-pypi-mirror/z3c.pypimirror/bin/pypimirror", line 14, in <module>
		z3c.pypimirror.mirror.run()
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 720, in run
		config["base_url"])
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 430, in mirror
		links = package.ls(filename_matches, external_links, follow_external_index_pages)
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 290, in ls
		return [(link[0], os.path.basename(link[0]), link[1]) for link in links]
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 258, in _links
		for link in self._fetch_links(remote_index_html):
	  File "/home/sridharr/as/pypm/parts/my-pypi-mirror/z3c.pypimirror/src/z3c/pypimirror/mirror.py", line 170, in _fetch_links
		soup = BeautifulSoup(html)
	  File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1499, in __init__
	  File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1230, in __init__
	  File "build/bdist.linux-i686/egg/BeautifulSoup.py", line 1263, in _feed
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 108, in feed
		self.goahead(0)
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 148, in goahead
		k = self.parse_starttag(i)
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 226, in parse_starttag
		endpos = self.check_for_whole_start_tag(i)
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 301, in check_for_whole_start_tag
		self.error("malformed start tag")
	  File "/opt/ActivePython-2.6/lib/python2.6/HTMLParser.py", line 115, in error
		raise HTMLParseError(message, self.getpos())
	HTMLParser.HTMLParseError: malformed start tag, at line 22, column 11



Follow ups

References