TL;DR —
In this article, I compare two parsing methods with a focus on their performance. The first method uses regular expressions for parsing and data extraction. The second method uses a Non-deterministic Finite Automata (NDFA) to parse text. The NDFA is a class of FSMs that can be used for any task that has distinct states of processing or operation. In practice, the algorithm has to follow every possible transition from a state to another and when a possible path fails it goes back and continues with another path.
[story continues]
Written by
@spidim
Senior Software Architect / Engineering Lead at Behavioral Signals
Topics and
tags
tags
finite-automatons|regular-expressions|java|parser|parsing|programming|text-parsing|hackernoon-top-story
This story on HackerNoon has a decentralized backup on Sia.
Transaction ID: 85K29CfXLGp3R4wEw3gD097_cGEz2JT5Ueyednudl_0
