Fork me on GitHub
#observability
<
2020-08-07
>
sparkofreason17:08:03

I'm about to write a bunch of log-line parsers for logs across the variety of languages/platforms we use. Any pointers to existing libraries that might handle this task?

lukasz18:08:04

instaparse or clj-antlr if you need the big guns, good old regexes are always there

lukasz18:08:42

clj-antlr might be the most robust as you might find compatible grammar definitions around the internet

lukasz18:08:58

(that's how we kick started our Lucene-like query interface in our product)

sparkofreason19:08:49

Started with regex, too limiting, have moved on to instaparse. Thought about clj-antlr, but didn't have a lot of hope for finding open-sourced grammars. I guess I should actually look...

jumar20:08:23

If care about speed ANTLR is much faster than instaparse

sparkofreason20:08:43

Definitely. Instaparse is nice for REPL-driven development, but once I get the grammars nailed down I'll probably move to ANTLR.

sogaiu21:08:49

i noticed that https://github.com/antlr/grammars-v4 had a variety of grammars -- is it possible one of them might already be close to what you might need?

sparkofreason12:08:11

I did look in there, didn't find anything. Anyway, turning out to be pretty easy to define log parsers in instaparse.