FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text. FASTUS is a system for extracting information from natural language text for entry into a database and for other applications. It works essentially as a cascaded, nondeterministic finite-state automaton. There are five stages in the operation of FASTUS. In Stage 1, names and other fixed form expressions are recognized. In Stage 2, basic noun groups, verb groups, and prepositions and some other particles are recognized. In Stage 3, certain complex noun groups and verb groups are constructed. Patterns for events of interest are identified in Stage 4 and corresponding “event structures” are built. In Stage 5, distinct event structures that describe the same event are identified and merged, and these are used in generating database entries. This decomposition of language processing enables the system to do exactly the right amount of domain-independent syntax, so that domain-dependent semantic and pragmatic processing can be applied to the right larger-scale structures. FASTUS is very efficient and effective, and has been used successfully in a number of applications.
Keywords for this software
References in zbMATH (referenced in 3 articles )
Showing results 1 to 3 of 3.
- Pajić, Vesna; Lažetić, Gordana Pavlović; Pajić, Miloš: Information extraction from semi-structured resources: a two-phase finite state transducers approach (2011)
- Friburger, N.; Maurel, D.: Finite-state transducer cascades to extract named entities in texts. (2004)
- Hobbs, Jerry R.; Appelt, Douglas E.; Bear, John; Israel, David J.; Kameyama, Megumi; Stickel, Mark E.; Tyson, Mabry: FASTUS: A cascaded finite-state transducer for extracting information from natural-language text (1997) ioport