related work & correlation of problem data
Paul Tagliamonte
paultag at debian.org
Tue Oct 1 14:26:56 UTC 2013
On Tue, Oct 01, 2013 at 03:48:47PM +0200, Martin Milata wrote:
> Hello firehose-devel,
Hey there Martin,
> I have recently stumbled upon the firehose format and surrounding
> ecosystem and realized that it somewhat overlaps with what we've been
> working on within the Automatic Bug Reporting Tool project [1]. I'll
> take the liberty of briefly introducing our format and software:
>
> * uReport is a JSON-based format for describing run-time software
> problems. It aims to contain no anonymous data and be useful for
> automatic processing. There's no formal specification but some
> information can be found at [2].
>
> * FAF is the main consumer of uReports. It performs some analysis on
> them (e.g. resolving addresses to source code locations, or grouping
> reports likely caused by the same bug), and provides a web interface
> with statistics. The instance for Fedora is available at [3]
>
> * satyr is a library (in C with Python bindings) for creating and
> manipulating uReports. It is used by ABRT to create uReports and by
> FAF to perform clustering on them. In terms of type of problems, it
> currently supports crashes of native binaries that produce core dump,
> uncaught python exceptions, and kernel oopses.
ACK. Is patching whoopsie-daisy to output this format on the roadmap?
I'd love to continue to foster collaboration from Fedora / RH -> Debian / Ubuntu
(doubly so if this is going to integrate with Firehose!)
> Due to the slightly different goals of both projects and substantial
> amount of code already written, it seems unclear whether it would be
> beneficial to directly collaborate. However, it might be interesting to
> find out if we can obtain anything useful by correlating the reports
> from static analyzers and from crashes on users' computers.
It would indeed. I plan on producing Firehose for the Debian archive
once I have the cycles to finish the management code.
> To be more concrete, we were thinking of doing something like this:
>
> * Run static analysis on a particular package, keep only the issues
> that would correspond to a crash if it occurred.
Hummmmm. This might be a toughie. Figuring out what may cause a crash
turns out to be hard(tm), but keeping *all* the data would be a snap.
> * Take the crash reports for that package.
>
> * For every static analysis result, try to find a crash report with the
> same source code location.
Interesting. This seems like it could be quite nice, if we keep debug
headers and have the lineno / bt
> Finding such pair would mean that the static analysis issue is not a
> false positive and also the additional information from the static
> analysis (e.g. trace) could be added to the crash report.
>
> Does it make any sense? Would the results of such analysis be useful?
I'm extremely intrigued.
> Best regards,
> Martin Milata
>
> [1] https://github.com/abrt/abrt/wiki/ABRT-Project
> [2] https://github.com/abrt/faf/wiki/uReport
> [3] https://retrace.fedoraproject.org/faf/summary/
> _______________________________________________
> firehose-devel mailing list
> firehose-devel at lists.fedoraproject.org
> https://admin.fedoraproject.org/mailman/listinfo/firehose-devel
Cheers,
Paul
--
.''`. Paul Tagliamonte <paultag at debian.org>
: :' : Proud Debian Developer
`. `'` 4096R / 8F04 9AD8 2C92 066C 7352 D28A 7B58 5B30 807C 2A87
`- http://people.debian.org/~paultag
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 836 bytes
Desc: Digital signature
URL: <http://lists.fedoraproject.org/pipermail/firehose-devel/attachments/20131001/98bd11d5/attachment.sig>
More information about the firehose-devel
mailing list