Discussion about this post

User's avatar
Neural Foundry's avatar

The hybrid LLM/regex pattern is particularly clever - sampling with LLMs to discover entities, then using traditional string matching for scale. Your point about modeling the liklihood of finding new terms through continued sampling is brilliant, essentially quantifying confidence in your extraction completeness. The congressional hearing matcher is a great example of using business logic as a filter before LLM adjudication.

Expand full comment
Andrew Gard's avatar

Awesome! Very helpful.

Expand full comment

No posts