On Tue, 2005-04-05 at 21:20 +0200, Kyrre Ness Sjobak wrote:
tir, 05.04.2005 kl. 17.35 skrev seth vidal:
> > Is that worth adding yet another XML Parser package to the distribution
> > used by a single tool ? Is there a compatibility layer to still use
> > libxml2 ?
> > If I remember correctly, the performance problem wasn't libxml2 itself
> > but the specific usage within yum, i.e. collecting the data, libxml2 by
> > itself is parsing the megabyte sized file in less than a tenth of a second.
> > I'm surprized the solution ends up going to use a python specific library
> > instead of trying to find why the interface between libxml2 and yum generated
> > that problem. I don't remember you saying you would switch library as a
result.
>
> well what happened was this:
> Icon was working on repoview and decided to try out CelementTree b/c he
> was using kid anyway and it used it. After some preliminary tests it
> showed up as significantly faster parsing the metadata. For
> primary.xml.gz the times went from 21s for 1800ish pkgs to 7s. Then when
> he switched it to use iterparse() the memory footprint dropped below 10M
> for the whole parse.
>
wow. That's just... amazing!
Anyway: How large are the package in question? After all, yum is a
pretty "core" package. It's not some obscure fringe thingy. So adding
*one* package to support it can't be all that bad?
After all, didn't OOo (another non-fringe package) pretty much cause
Java to be included?
No, OOo only requires gcc-java and libgcj, nothing else. Eclipse was
the big thing that pulls in all the real java support libraries.
Dan