Jump to content

Tree stack automaton

From Wikipedia, the free encyclopedia

A tree stack automaton[a] (plural: tree stack automata) is a formalism considered in automata theory. It is a finite-state automaton with the additional ability to manipulate a tree-shaped stack. It is an automaton with storage[2] whose storage roughly resembles the configurations of a thread automaton. A restricted class of tree stack automata recognises exactly the languages generated by multiple context-free grammars[3] (or linear context-free rewriting systems).

Definition

[edit]

Tree stack

[edit]
A tree stack with stack pointer 1.2 and domain {ฮต, 1, 42, 1.2, 1.5, 1.5.3}

For a finite and non-empty set ฮ“, a tree stack over ฮ“ is a tuple (t, p) where

  • t is a partial function from strings of positive integers to the set ฮ“ โˆช {@} with prefix-closed[b] domain (called tree),
  • @ (called bottom symbol) is not in ฮ“ and appears exactly at the root of t, and
  • p is an element of the domain of t (called stack pointer).

The set of all tree stacks over ฮ“ is denoted by TS(ฮ“).

The set of predicates on TS(ฮ“), denoted by Pred(ฮ“), contains the following unary predicates:

  • true which is true for any tree stack over ฮ“,
  • bottom which is true for tree stacks whose stack pointer points to the bottom symbol, and
  • equals(ฮณ) which is true for some tree stack (t, p) if t(p) = ฮณ,

for every ฮณ โˆˆ ฮ“.

The set of instructions on TS(ฮ“), denoted by Instr(ฮ“), contains the following partial functions:

  • id: TS(ฮ“) โ†’ TS(ฮ“) which is the identity function on TS(ฮ“),
  • pushn,ฮณ: TS(ฮ“) โ†’ TS(ฮ“) which adds for a given tree stack (t,p) a pair (pn โ†ฆ ฮณ) to the tree t and sets the stack pointer to pn (i.e. it pushes ฮณ to the n-th child position) if pn is not yet in the domain of t,
  • upn: TS(ฮ“) โ†’ TS(ฮ“) which replaces the current stack pointer p by pn (i.e. it moves the stack pointer to the n-th child position) if pn is in the domain of t,
  • down: TS(ฮ“) โ†’ TS(ฮ“) which removes the last symbol from the stack pointer (i.e. it moves the stack pointer to the parent position), and
  • setฮณ: TS(ฮ“) โ†’ TS(ฮ“) which replaces the symbol currently under the stack pointer by ฮณ,

for every positive integer n and every ฮณ โˆˆ ฮ“.

Illustration of the instruction id on a tree stack
Illustration of the instruction push on a tree stack
Illustration of the instructions up and down on a tree stack
Illustration of the instruction set on a tree stack

Tree stack automata

[edit]

A tree stack automaton is a 6-tuple A = (Q, ฮ“, ฮฃ, qi, ฮด, Qf) where

  • Q, ฮ“, and ฮฃ are finite sets (whose elements are called states, stack symbols, and input symbols, respectively),
  • qi โˆˆ Q (the initial state),
  • ฮด โІfin. Q ร— (ฮฃ โˆช {ฮต}) ร— Pred(ฮ“) ร— Instr(ฮ“) ร— Q (whose elements are called transitions), and
  • Qf โІ TS(ฮ“) (whose elements are called final states).

A configuration of A is a tuple (q, c, w) where

  • q is a state (the current state),
  • c is a tree stack (the current tree stack), and
  • w is a word over ฮฃ (the remaining word to be read).

A transition ฯ„ = (q1, u, p, f, q2) is applicable to a configuration (q, c, w) if

  • q1 = q,
  • p is true on c,
  • f is defined for c, and
  • u is a prefix of w.

The transition relation of A is the binary relation โŠข on configurations of A that is the union of all the relations โŠขฯ„ for a transition ฯ„ = (q1, u, p, f, q2) where, whenever ฯ„ is applicable to (q, c, w), we have (q, c, w) โŠขฯ„ (q2, f(c), v) and v is obtained from w by removing the prefix u.

The language of A is the set of all words w for which there is some state q โˆˆ Qf and some tree stack c such that (qi, ci, w) โŠข* (q, c, ฮต) where

  • โŠข* is the reflexive transitive closure of โŠข and
  • ci = (ti, ฮต) such that ti assigns for ฮต the symbol @ and is undefined otherwise.
[edit]

Tree stack automata are equivalent to Turing machines.

A tree stack automaton is called k-restricted for some positive natural number k if, during any run of the automaton, any position of the tree stack is accessed at most k times from below.

1-restricted tree stack automata are equivalent to pushdown automata and therefore also to context-free grammars. k-restricted tree stack automata are equivalent to linear context-free rewriting systems and multiple context-free grammars of fan-out at most k (for every positive integer k).[3]

Notes

[edit]
  1. ^ not to be confused with a device with the same name introduced in 1990 by Wolfgang Golubski and Wolfram-M. Lippe [1]
  2. ^ A set of strings is prefix-closed if for every element w in the set, all prefixes of w are also in the set.

References

[edit]
  1. ^ Golubski, Wolfgang and Lippe, Wolfram-M. (1990). Tree-stack automata. Proceedings of the 15th Symposium on Mathematical Foundations of Computer Science (MFCS 1990). Lecture Notes in Computer Science, Vol. 452, pages 313โ€“321, doi:10.1007/BFb0029624.
  2. ^ Scott, Dana (1967). Some Definitional Suggestions for Automata Theory. Journal of Computer and System Sciences, Vol. 1(2), pages 187โ€“212, doi:10.1016/s0022-0000(67)80014-x.
  3. ^ a b Denkinger, Tobias (2016). An automata characterisation for multiple context-free languages. Proceedings of the 20th International Conference on Developments in Language Theory (DLT 2016). Lecture Notes in Computer Science, Vol. 9840, pages 138โ€“150, doi:10.1007/978-3-662-53132-7_12.