The formalization and implementation of Adaptable Parsing Expression Grammars.

Resumo

The term “extensible language” is especially used when a language allows the extension of its own concrete syntax and the definition of the semantics of new constructs. Most popular tools designed for automatic generation of syntactic analysers do not offer any adequate resources for the specification of extensible languages. When used in the implementation of features like syntax macro definitions, these tools usually impose severe restrictions. For example, it may be required that macro definitions and their use reside indifferent files; or it may be impossible to perform the syntax analysis in one single pass. We claim that one of the main reasons for these limitations is the lack of appropriate formal models for the definition of the syntax of extensible languages. This paper presents the design and formal definition of Adaptable Parsing Expression Grammars, an extension to the Parsing Expression Grammar (PEG) model that allows the manipulation of its own production rules during the analysis of an input string. The proposed model compares favourably with similar approaches for the definition of the syntax of extensible languages. An implementation of the model is also presented, simulating the behavior of packrat parsers. Among the challenges for this implementation is the use of attributes and on the fly modifications on the production rules at parse time, features not present in standard PEG. This approach has been used on the definition of a real extensible language, and initial performance tests suggest that the model may work well in practice.

Descrição

Palavras-chave

Extensible languages, Adaptable grammars

Citação

REIS, L. V. dos S. et al. The formalization and implementation of Adaptable Parsing Expression Grammars. Science of Computer Programming, v. 96, p. 191-210, 2014. Disponível em: <http://www.sciencedirect.com/science/article/pii/S0167642314000872>. Acesso em: 09 jan. 2015.

Avaliação

Revisão

Suplementado Por

Referenciado Por