Hello, Firehosen,
So, forewarning: This is all very very hackey and gross.
I wrote a small native-object converter[1], which uses __slots__ to
populate a dict of dicts (it looses a bit of data, but is otherwise
rad), in order to store the entire Firehose report in a MongoDB
document datastore.
I've only loaded Debian's Chromium build-logs, but I plan on doing a
full run this weekend, if all goes well.
db.results.find().count()
211
db.results.find({"results.location.file.givenpath":
"third_party/libphonenumber/src/phonenumbers/utf/unicodetext.cc"}).count()
51
db.results.find({"metadata.sut.buildarch":
"armhf"}).count()
22
Two things:
1) This is gross. I don't like how I'm converting to Pythonic objects.
Who's +1 / -1 on adding this as a top-level method like to_xml &
from_xml ?
2) I like Mongo, a lot. Would anyone mind if I supported it more? I can
work with PostgreSQL or another relational DB, but I've grown to
love MongoDB the more I use it :)
Cheers,
Paul
[1]:
https://github.com/paultag/storz/blob/master/storz/decompress.py
https://github.com/paultag/storz/blob/master/storz/store.py
--
.''`. Paul Tagliamonte <paultag(a)debian.org>
: :' : Proud Debian Developer
`. `'` 4096R / 8F04 9AD8 2C92 066C 7352 D28A 7B58 5B30 807C 2A87
`-
http://people.debian.org/~paultag