|
LogAnalysis
[logs] Error messages from syslogd Jul 11 2007 06:38AM Tina Bird (tbird precision-guesswork com) (2 replies) Re: [logs] Error messages from syslogd Jul 11 2007 03:47PM Stephen John Smoogen (smooge gmail com) (1 replies) RE: [logs] Error messages from syslogd Jul 11 2007 04:41PM Tina Bird (tbird precision-guesswork com) (3 replies) RE: [logs] Error messages from syslogd Jul 11 2007 09:02PM Rainer Gerhards (rgerhards hq adiscon com) RE: [logs] Error messages from syslogd Jul 11 2007 07:13PM Daniel Cid (danielcid yahoo com br) (2 replies) Re: [logs] Error messages from syslogd Jul 11 2007 07:42PM Justin Mitchell (jmitchell secureworks com) (2 replies) Re: [logs] Error messages from syslogd Jul 11 2007 08:00PM Marcus J. Ranum (mjr ranum com) (1 replies) Re: [logs] Error messages from syslogd Jul 11 2007 09:55PM Russell Fulton (r fulton auckland ac nz) (2 replies) Re: [logs] Error messages from syslogd Jul 12 2007 12:32AM Marcus J. Ranum (mjr ranum com) (1 replies) RE: [logs] Error messages from syslogd Jul 11 2007 07:50PM Tina Bird (tbird precision-guesswork com) (1 replies) RE: [logs] Error messages from syslogd Jul 11 2007 07:16PM Tina Bird (tbird precision-guesswork com) Re: [logs] Error messages from syslogd Jul 11 2007 08:00AM Rainer Gerhards (rgerhards hq adiscon com) |
|
Privacy Statement |
In fact the parsing tree you've mentioned and the regular expressions
can be used together - OSSEC, for example, implements hierarchical
rules that are based on regular expressions. So I guess the
difficulties you mentioned are not RE-related but rather list vs tree
parsing methods related.
http://www.ossec.net/ossec-docs/auscert-2007-dcid.pdf
(see page 32)
Kind regards,
Amiran Alavidze
On 7/12/07, Marcus J. Ranum <mjr (at) ranum (dot) com [email concealed]> wrote:
> Briefly, here's what needs to be done:
> Instead of trying to build sequences of fall-through matching rules, you
> need to build an acyclic left-to-right parse tree that completely defines
> all the message forms. Abe and I did some experimenting with this
> a few years ago and it turned out there are only about 50,000 variant
> forms of messages - well - 60,000 if you count all the new ones the
> OpenBSD guys appear to have added to syslogd. Obviously, you
> don't need them all at once; you just need the ones you're seeing.
> What's funny is that executing against a parse tree is going to be
> oodles faster than a list of regular expressions once you go above
> a certain number.
>
> I offer as evidence an evil hack I did for a buddy last year, which
> simply builds a nested sequence of calls to sscanf, keeping
> track of the farthest-right point of matching, and walking left
> to right. I forget the exact number but it was handling something
> like 400,000 log lines per second at 2% CPU utilization (I.e.: it
> was I/O bound) - I am not necessarily suggesting that someone
> build a tool that outputs a C-coded recognizer... but consider
> for a second that if you did, your iPod could handle log data
> rates that make the largest commercial SIMs eat their own
> intestines in sheer agony.
_______________________________________________
LogAnalysis mailing list
LogAnalysis (at) loganalysis (dot) org [email concealed]
http://www.loganalysis.org/mailman/listinfo/loganalysis
[ reply ]