41080 - Theory of Computing Science

Automata and Regular Languages

Model of computation

A mathematical model describing how a function parses an input of symbols to halt and obtain a boolean output of accept or reject.

Sequential models

This subject will discuss the following models of computation

DFA
NFA
PDA
NPDA
TM

Word Parola 単語

A string consisting of symbols that is input to some model of computation. Words are conventionally denoted as $w$

Empty string

A word with no symbols, represented by $ \lambda$ or $ \varepsilon$

Language Linguaggio 言語

Set of all words accepted by some model of computation $M$. It is often denoted as $L$ or $L(M)$ to denote the language of specific machine $M$. Note that languages are subsets of the set of all words

$w \in L(M) \iff M \text{ accepts } w$

Language operators

Union $L_{1} \cup L_{2} = \{ x : x \in L_{1} \lor x \in L_{2} \}$
Intersection $L_{1} \cap L_{2} = \{ x : x \in L_{1} \land x \in L_{2} \}$
Complement $\overline{L} = \{ x : x \in \Sigma^{*} \land x \notin L \}$
Concatenation $L_{1} \cdot L_{2} = \{ xy : x \in L_{1} \land y \in L_{2} \} $
Reverse $L^{R} = \{ x_{n}x_{n-1}x_{n-2}...x_{0} : x_{0}x_{1}x_{2}...x_{n} \in L \}$
$ L^{*} = \{ \} $
Kleen star; infinite set with different counts of concatenations of each string in the set. It is denoted as $ L_{1}^{*}$, where the first element is $ \lambda$ for the empty string, then the next $|L_{1}|$ elements are the elements of the set, the next $|L_{1}|^2$ are the ways of ordering two elements of the set with replacement, then for each subsequent $n$, there are $|L_{1}|^n$ more elements for this same reason

Determinism Determinismo 決定論

Property of a model of computation such that the computation process has a single predetermined sequence of steps to obtain an output

Nondeterminism refers to the property where there are multiple sequences of computation, and the machine accepts iff there exists some sequence that accepts

Deterministic Finite Automaton (DFA)

A computing model that works based on an input string as instructions where the leftmost symbol determines if and to what state the program changes to, and then this symbol is popped off the string. Then when the string is exhausted (becomes an empty string, AKA $\lambda$) the string input is accepted if the program's state is in one of the designated states, otherwise the result is rejected.

Formal definition

DFA are defined by a 5 tuple $ M = ( Q , \Sigma , \delta , q_{0} , F) $ where:

$Q$ is the set of finite states the program can enter
$\Sigma$ is the set of possible symbols denoted $\sigma \in \Sigma$ that the input string may contain. This is refered to as an alphabet
$ \delta$ is a transition function that determines which state the program goes into given a current state and current symbol $\delta : Q \times \Sigma \to Q$
$q_{0}$ is the intitial state that the program begins in, so $q_{0} \in Q$
$F$ is the set of the accepted states for an input string to finish in, so we know $F \subseteq Q$

Graphical representation

We can represent DFAs as graphs such that $M=G(Q,E)$ where each edge between states is labelled with $\sigma$

Product construction

Merging two DFAs together logically under some operation using the formal definition, where the new DFA's states are based on ordered pairs (or tuples) of all the amalgamated DFAs

State diagrams

State diagrams often give easy-to-understand visual references for the functionality of a DFA

Kleene star Stella Kleene クリーンの星

Nondeterministic Finite Automaton (NFA)

A spin on the DFA. The DFA is deterministic, meaning that $\delta$ determines a state for the program to enter as an output. NFAs have the pheonomenon that $\delta$ offers a set of legal states that can be entered which can be chosen from. A word is accepted so long as there exists at least one path to a $q \in F$. An NFA will reject a string if it cannot find a path. NFAs let you designate a path for $\lambda$, which is just passing a string without popping anything off. It is also lenient in that a state doesn't need a path for every element in the alphabet

Formal definition

DFA are defined by a 5 tuple $ M = ( Q , \Sigma , \delta , Q_{0} , F) $ where:

$Q$ is the set of finite states the program can enter
$\Sigma$ is the set of possible symbols denoted $\sigma \in \Sigma$ that the input string may contain. This is refered to as an alphabet
$ \delta$ is a transition function that determines which state the program goes into given a current state and current symbol $\delta : Q \times (\Sigma \cup \{ \lambda \} ) \to 2^{Q}$ (A state and a word gives a subset of Q, which is denoted as $2^{Q}$ due to its cardinality)
$Q_{0}$ is the set of intitial states that the program begins in, so $Q_{0} \subseteq Q$
$F$ is the set of the accepted states for an input string to finish in, so we know $F \subseteq Q$

Multiple start

NFAs may allow for multiple initial states. In the case that it doesn't, you can easily emulate the idea of multiple start states by having one state lead into the states that you originally wanted to make the start states

GNFA

An regular expression that has the exact language of the NFA in question

DFA-NFA equivalence theorem

Theorem asserting some DFA recognizes a language iff an NFA recognises that language

$\exists \text{DFA} : L(\text{DFA}) = L' \iff \exists \text{NFA} : L(\text{NFA}) = L'$

Proof

Powerset construction; Since NFAs can have several paths, making each DFA state based on a subset of states (that represent moreso a 'path') in the NFA solves this at the cost of extra nodes
$\varepsilon$ closure; since a DFA doesn't allow paths based on empty characters, to bypass this, the DFA can consider the state and the other state linked by $\varepsilon $ as one single state. We notate the epsilon closure of a state as $\varepsilon (q)$

Regular language

Language recognizable by some finite automaton

Theorems

$L(M)= A \implies \exists N : L(N) = \overline{A}$

$L(M)= A \land L(N)= B \implies \exists V : L(V) = A \cup B$

$L(M)= A \land L(N)= B \implies \exists V : L(V) = A \cap B$

$L(M)= A \implies \exists N : L(N) = A^{R}$

Regular expressions

Expression used to describe regular languages in a more concise and conventient way than finite automata

Formal definition

Regular expressions are defined with some base constants that can be inductively extended through operations

Constants

$\Sigma$ Alphabet
$\emptyset$ Empty set
$a$ Literal character such that $a \in \Sigma$
$\varepsilon$ Empty string

Operations

Where $R_{1},R_{2}$ are regular expressions and $L_{1},L_{2}$ are languages:

$L_{1} \cdot L_{2} \iff (R_{1}R_{2}) $
$L_{1} \cup L_{2} \iff (R_{1}+R_{2}) $
$L^{*} \iff (R^{*}) $

Pumping lemma

The idea that going to the gym gives you 'the pump'

Lemma claiming that for any regular language $L$ and any word of that language $w \in L$, there is a $P \in \mathbb{N} : P \geq |Q|$ such that $\forall w:|w| \geq P$

$w=xyz$
$|y| \gt 0$
$|xy| \leq P$
$\forall k \geq 0, xy^{k}z \in L$

So $w \in L : |w| \geq P$ for regular languages have the structure of having padding $x,y$ on both sides of the string while in between this padding any amount of $y$ can be 'pumped' inbetween. If there are $P$ states then any string larger than $P$ must visit some state multiple times (might remind you a bit about pidgeon hole principle) and hence loop around $k$ times until going to the end padding. This lets us pose the conditions that a loop exists ($ |y| \gt 0$), and the path from the start to the end of the first iteration of the loop is at most size of the amount of states ($ |xy| \leq P $). If we have a loop, we are able to invoke it as many times as we like, so therefore if the language is regular then $\forall k \geq 0, xy^{k}z \in L$

Note that the absolute value poles give the strings length

Proof

Equivalence relation Relazione d'equivalenza

See Discrete Mathematics

Distinguishable Strings

We say that string $z$ distinguishes strings $x$ and $y$ relative to a language $L$ when $(xz \in L \land yz \notin L) \lor (yz \in L \land xz \notin L)$ (So concatenating $z$ onto these strings makes one of the strings in the language and the other not in the language, basically partitioning one string in the language and the other not).

Indistinguishable Strings

When $x$ and $y$ can't find a $z$ that does this, we call them indistinguishable, and write it such that $x \equiv_{L} y$. logically this means $xz \in L \iff yz \in L$ and importantly, is an equivalence relation called an equivalence class

Proof of equivalence relation

For reflexivity, it means $xz \in L \iff xz \in L$, which is clearly true. For symmetry, $yz \in L \iff xz \in L$, which is also clearly true, which is also clearly true. For transitivity, $x \equiv_{L} y \land y \equiv_{L} z \implies x \equiv_{L} z$. Now looking at the LHS only we can see that no matter what we concatenate onto any $x,y,z$ it will be in $L$, so it holds for transitivity and hence equivalence relations

Myhill-Nerode Theorem

If there are a finite amount of equivalence classes for $\equiv L$ (), then $L$ is regular

This is because $\neg (x \equiv_{L} y)$ means that $x$ and $y$ are strings leading to different states, and if you have an infinite pair of different states, you have infinite states, and it's called a finite state machine for a reason right?

Context-free Languages

Stack

Data buffer where data is retrieved by pushing (loading data object on the top of the stack) and popping (removing and returning data object on the top of the stack). This access method is known as LIFO (Last In First Out)

Pushdown Automata (PDA)

A finite automata that has access to a stack

Formal definition

DFA are defined by a 6 tuple $ M = ( Q , \Sigma , \Gamma , \delta , q_{0} , F) $ where:

$Q$ is the set of finite states the program can enter
$\Sigma$ is the set of possible symbols denoted $\sigma \in \Sigma$ that the input string may contain. This is refered to as an string alphabet
$\Gamma$ is the set of possible symbols denoted $\gamma \in \Gamma$ that the stack may contain. This is refered to as an stack alphabet
$ \delta$ is a transition function that determines which state the program goes into given a current state and current symbol $\delta : Q \times (\Sigma \cup \{\lambda\}) \times (\Gamma \cup \{\lambda\}) \to 2^{Q \times (\Gamma \cup \{\lambda\})}$
$q_{0}$ is the intitial state that the program begins in, so $q_{0} \in Q$
$F$ is the set of the accepted states for an input string to finish in, so we know $F \subseteq Q$

Stackbase

To represent the base of the stack, we use $$$

Graphical representation

We can represent PDA as graphs just like DFAs and NFAs such that $M=G(Q,E)$, where each edge is defined by $\sigma, \gamma_{\text{pull}} \rightarrow \gamma_{\text{push}}$

Context-free language (CFL)

Language recognisable by some PDA

Context Free Grammar (CFG)

CFGs are a 'grammar' (set of lexical rules) analogous to regular expressions for PDAs in the sense that you can represent any context free language with a CFG like how a regular expression can express any regular language

Formal definition

CFGs are defined by a 4 tuple $ M = ( V , \Sigma , R , S) $ where:

$V$ is the set of variables or otherwise called nonterminals; variables that can be defined by rules of other variables and terminals
$\Sigma$ is the set of terminals; symbols denoted $\sigma \in \Sigma$ that are concrete symbols
$R$ is the set of rules$R : V \times (V \cup \Sigma)^{*} $ that define what strings a variable can represent in the form $v \to w : v \in V, w \text{ is a string of variables and terminals}$. This means that we can transform $v$ into $w$ in a substitution-like manner. This relationship is denoted like $v \implies w$
$S$ is the start variable $S \in V$

Derivation

If we have two $s,t \in (V \cup \Sigma)^{*}$ such that we can use rules to either explicity or implicityly (by one or many rules) using $u_{n} \in (V \cup \Sigma)^{*}$, we say that s derives t, and in mathematical notation, $s \implies^{*} t$

CFG-PDA equivalence

It can be shown that a PDA is logically equivalent to some CFG. To convert from CFG to PDA you can simply use the stack transition as a method to emulate the idea of applying rules

PDA-CFG equivalence

To prove the logical equivalence, we also need to ensure that PDAs imply a CFG (so now the other way around). This can be done by viewing the stack against time in a 'graph-like' manner, so we can intuitively see the following:

At some intermediate point the stack becomes empty $A_{pq} = A_{pr}A_{rq}$
Otherwise, we find the string padding to get from p to r, use the variable that gets from r to s, then use the string padding that goes from s to q $A_{pq} = aA_{rs}b$ (we are recursively defining internalstacks within the stack where some arbitrary element is the internal stack's stackbase)

This is another method highlighting how the stack mimics the properties of grammar through internal stacks as a way to create rules

Deterministic Pushdown Automaton (DPDA)

The same as a regular PDA, however determinism is employed. This is done by two main rule:

$\exists q \in Q, \gamma \in \Gamma : \delta (q,\lambda,\gamma)$ is defined, that implies $\forall s, \delta (q,\sigma,\gamma) = \emptyset $

$\exists q \in Q, \sigma \in \Sigma: \delta (q,\sigma,\lambda)$ is defined, that implies $\forall s, \delta (q,\sigma,\gamma) = \emptyset $

If this weren't the case, non-determinism could occur because if the top stack symbol were $g$, the program could either take $\lambda$ path or $s$ path, and non-determinism is not what we want

Formal definition

DFA are defined by a 6 tuple $ M = ( Q , \Sigma , \Gamma , \delta , q_{0} , F) $ where:

$Q$ is the set of finite states the program can enter
$\Sigma$ is the set of possible symbols denoted $\sigma \in \Sigma$ that the input string may contain. This is refered to as an string alphabet
$\Gamma$ is the set of possible symbols denoted $\gamma \in \Gamma$ that the stack may contain. This is refered to as an stack alphabet
$ \delta$ is a transition function that determines which state the program goes into given a current state and current symbol $\delta : Q \times (\Sigma \cup \{\lambda\}) \times (\Gamma \cup \{\lambda\}) \to (Q \times (\Gamma \cup \{\lambda\}) \cup \emptyset$
$q_{0}$ is the intitial state that the program begins in, so $q_{0} \in Q$
$F$ is the set of the accepted states for an input string to finish in, so we know $F \subseteq Q$

NPDA inequivalence

Turns out NPDAs are more powerful than DPDAs

Deterministic context-free language (DCFL)

Language that can be read by DPDAs

Top-down parser

Deriving a word from a grammar can be thought of as trees steming from $S$ (the start variable) to derive words using combinations of the avalable rules $r\in R$. A top-down parser aims to build this tree starting from $S$ to verify whether the word satisfies the grammar

LL(1) parser

A specific top-down parser that works by using a rule on the leftmost variable and aims to first resolve the leftmost character of the desired word

Left right; a word is resolved from left to right

Leftmost derivation; the leftmost variable in the tree has the rule applied to it

1 symbol lookahead;

Left recursion problem; way of soft locking LL(1) parser to end up infinitely resolving leftmost variable. Rules of the form $A \to A\alpha | \beta $ cause this. To solve this, replace that rule with $A \to \beta R, R \to \alpha R | \varepsilon $
Left factoring problem; way of confusing the LL(1) parser as there are several valid rules to apply. Rules such as $A \to 0w | 0v$ cause this. To solve it, replace the rule with $A \to 0X,X \to w | v$

Parsing table

In some cases, LL(1) can't resolve a DCFL and so a parsing table should be used to help guide which rules can be used to get which results. To construct such a table, the idea of FIRST and FOLLOW sets must be explored

FIRST sets

The FIRST set of a variable or terminal represents the set of all terminals that the variable can initiate with even when rules are applied. In the case of the FIRST set of some terminal, it is simply the terminal itself, since terminals are in fact terminal can can't derive any other string.

$a \implies a \in \text{FIRST}(a)$

$X \to Y_n \land \lambda \in \bigcap^{i-1}_{k=0} \text{FIRST}(Y_{k}) \land \exists a\in \text{FIRST}(Y_i): a \neq \lambda \implies \text{FIRST}(Y_i) \in \text{FIRST}(X)$

$X \to a \implies a \in \text{FIRST}(X)$

FOLLOW sets

The FOLLOW set if a variable represents the set of all terminals that can follow the end of a variable when rules are applied.

$X=S \implies $ \in \text{FOLLOW}(X)$

$X\to uYw \implies \text{FIRST}(w) \in \text{FOLLOW}(Y) $

$X\to uY \implies \text{FOLLOW}(X) \in \text{FOLLOW}(Y) $

After creating these sets for each variable, create a table with a terminals column and variables row. Then in each data cell, place the rule that is responsible for making a terminal in a variable's FIRST set. if the terminal in question is empty string, then focus on the FOLLOW set instead

Context-free Pumping lemma

Regular languages have the characteristic of a large enough string (of more accurately, when $|w| \gt |Q|$) being guaranteed to reuse some states. Context-free languages have the characteristic of a large enough string being guaranteed to reuse the same variable

$w=uvxyz$
$|vy| \gt 0$
$|vxy| \leq P$
$\forall k \geq 0, uv^{k}xy^{k}z \in L$

Turing Machines and Computability Theory

Turing machine (TM)

A theoretical machine devised by Alan Turing with the following aspects:

Tape divided into cells with a letter in each cell
A read and write head for the tape
Program that based on the letter read from the head can either:
- Rewrite the letter
- Move the head left or right
- Change the programs internal state
- Terminate the program

Formal definition

TM are defined by a 7 tuple $ M = ( Q , \Sigma , \Gamma , \delta , q_{start} , q_{accept} , q_{reject} ) $ where:

$Q$ is the set of finite states the program can enter
$\Sigma$ is the set of the finite input alphabet $\Sigma \subset \Gamma$ denoted $\sigma \in \Sigma$
$\Gamma$ is the set of the finite tape alphabet denoted $\gamma \in \Gamma, \sqcup \in \Gamma$
$ \delta$ is a transition function that determines which state the program goes into given a current state and current symbol $\delta : Q \times \Gamma\to Q \times \Gamma \times \{L,R\} $

TMs either halt in an accpet, reject, or neither state, or never halt.

Recognizability

Property of a language such that some Turing machine halts on an accept state only when the word is in the language

$L \text{ is recognizable } \iff \exists M : (M \text{ halts on } q_{accept} \iff w \in L ) $

Decidability

Property of a language such that some Turing machine can partition the language using the $q_{accept},q_{reject}$, that is, all words in the language halt on an accept state and otherwise halt on a reject state

$L \text{ is decidable } \iff L \text{ is recognizable } \land \exists M : (M \text{ halts on } q_{reject} \iff w \notin L )$

Turing completeness

A model of computation is Turing complete iff it can perform any algorithm that a Turing machine can. The Lambda calculus is an example of such a model of computation.

Church-Turing Hypothesis

An algorithm is computable if and only if a turing machine can compute said algorithm

Computability theory

Whether a turing machine given enough tape and time can compute

Data

An array of units of information

Program

Instructions for data handling. Programs themselves can be represented as data

Turing machine encoding

Since programs can be represented as data to be parsed by other programs, Turing machines can be represented as strings to be parsed by other Turing machines. We denote $\Lambda = \{ 0,1 \}$ as the alphabet of the Turing machine. We can do this because no matter the cardinality of $ \Lambda$, we can always represent each letter as $ \lceil \log_{2}(|\Lambda|) \rceil$ bits concatenated, in the same way that ASCII works. We need to define this standard of encoding ourselves since there doesn't exist any standard

Universal Turing Machine

The idea of a Turing machine $U$ that can take another Turing machine $M$ as input along with an input string $w$ and recognise the same language as $M$. $U = \{(M,w) : M \text{ is a Turing machine }\land w \in L(M)\} $

Russell's Paradox

Set theoretical paradox that arises from applying a partitoning class on a set that by definition partitions some class

Russell's Barber Paradox

Consider a barber that cuts hair for all people that don't cut their own hair. Does the barber cut his own hair? Whether he does or doesn't, both lead to a contradiction

Unrecognizability

There exist languages that no Turing machine can recognize, simply due to the existance of only $\aleph_0$ Turing machines, yet $\aleph_1$ languages

Proof

Proving unrecognizability simply requires demonstrating the inability to surjectively map each Turing machine to a language.

Let $TM$ be the set of all Turing machines and $L$ be the set of languages, note that by Turing machine encoding, $TM$ is countable.

Before beginning the proof, a necessary lemma is that a set is never surjective to its powerset. To prove this, by WOC, let $f : S \to 2^{S}$ be a surjective function mapping a set to its powerset and define $U = \{s \in S : s \notin f(s)\} \in 2^{S}$. The assumption of surjection implies that $\exists s \in S : f(s)=U$. Now apply Russel's paradox to finish the proof of the lemma:

$s \in U \implies s \notin f(s) \implies s \notin U$
$s \notin U \implies s \in f(s) \implies s \in U$

$L$ contains all sets of elements of form $ \{0,1\}*$, a subset of which is $2^{TM}$. By the previous lemma, $TM$ has no surjective function to $2^{TM} \subset L$ and hence there exists languages with no associated Turing machine, completing the proof.

Undecidability

There exist languages that no Turing machine can decide. Since all decidable languages are recognizable, this is a simple corrolary of the existance of unrecognisable languages. Moreover, there are languages that are recognizable but undecidable, a notable example being the $A_{TM}$ language; the language of all Turing Machine-word pairs that accept

Examples

$ A_{TM} = \{ (M,w) : M \text{ accepts } w\} $
$ HALT_{TM} = \{ (M,w) : M \text{ halts on } w\} $
$ E_{TM} = \{ M \in TM : L(M) = \emptyset \} $

Proof methods

Contradiction by russel's paradox
Mapping reduce an undecidable language to the language
Rice's theorem

Proof for $A_{TM}$

Proving undecidability requires demonstrating a contradictory counterexample by assuming a specific language to be decidable only to reveal paradoxes (this screams Russel's paradox).

By WOC, let $ A_{TM} $ be decidable by $H(M,w)$. consider $H'(M)$ that accepts when $H(M,M)$ rejects and rejects when $H(M,M)$ accepts. Now apply Russel's paradox to complete the proof:

$H'(H') \text{ accepts } \implies H' \text{ rejects } H' $
$H'(H') \text{ rejects } \implies H' \text{ accepts } H' $

Therefore because of the assumption of deciding $A_{TM}$, paradoxes occur and hence $A_{TM}$ is an undecidable language

Proof for $HALT_{TM}$

By WOC, let $HALT_{TM}$ be decidable by $H(M,w)$. Under this assumption, a decider for $A_{TM}$ $H'$ can be constructed, but since this language has been proved undecidable this is a contradiction, implying that $HALT_{TM}$ is not in fact decidable. This is essentially a mapping reduction

Proof for $E_{TM}$

By WOC, let $E_{TM}$ be decidable by $H(M,w)$. Under this assumption, a decider for $A_{TM}$ $H'$ can be constructed, but since this language has been proved undecidable this is a contradiction, implying that $E_{TM}$ is not in fact decidable. This is essentially a mapping reduction

Let $H'(M,w)$ create an auxiliary Turing machine $X(w')$ that rejects if $w' \neq w$ and otherwise returns the result of $M(w')$. This auxiliary machine is designed to have a language of either no elements or solely the element $ (M,w) $, proving a useful mapping reduction to translate the notion of 'empty sets' to 'rejection'. Then return $H(X(M,w))$

Mapping reducibility

$A \leq_{m} B \iff \exists f : \Sigma^{*} \to \Sigma^{*}, w \in A \iff f(w) \in B$

$A$ reduces to $B$ when some computable function (perhaps represented as a Turing machine) can only map all words in $A$ to a word in $B$. This is used as a method of constructing proofs for the decidability of languages.

Theorems

$ A \leq_{m} B \land B \leq_{m} C \implies A \leq_{m} C$
$ A \leq_{m} B \land B \text{ is decidable } \implies A \text{ is decidable}$
$ A \leq_{m} B \land A \text{ is undecidable } \implies A \text{ is undecidable}$
$ A_{TM} \equiv_{m} HALT_{TM} $

Proofs

Let $f$ be the mapping reducible function from $A$ to $B$ and $h$ the function from $B$ to $C$. Then $w \in A \iff h(f(w)) \in C$, hence proving the transitive property of mapping reducibility

Let $ (M,w) $ be a pair in $A_{TM}$. Use Turing machine $M'(M,w)$ as part of the mapping reducible function that accepts when $M(w)$ accepts, and otherwise loops infinitely (it must not reject, as rejection is a type of 'halt'). Since $ (M,w) \in A_{TM} \iff (M'(M),w) \in HALT_{TM} $, then $A_{TM} \leq_{m} HALT_{TM}$

Let $ (M,w) $ be a pair in $HALT_{TM}$. Use Turing machine $M'(M,w)$ as part of the mapping reducible function that accepts when $M(w)$ halts, and otherwise loops infinitely or rejects. Since $ (M,w) \in HALT_{TM} \iff (M'(M),w) \in A_{TM} $, then $HALT_{TM} \leq_{m} A_{TM}$

These two conditions are the corrolaries necessary to conclude that $ A_{TM} \equiv_{m} HALT_{TM} $

Property

A characteristic function that partitions the set of Turing machines $TM$ into two sets. Less formally, it categorizes Turing machines into two groups based on whether the Turing machine meets a specific boolean condition

$P : \{\text{TM}\} \to \{ 0,1 \}$

We say that $L_P = \{M : P(M) = 1\} $ be the language of all Turing machines fufiling some property $P$

Syntactic; Property related to the description of Turing machine $M$, the defined characteristics of the machine
Semantic; Property related to the language of the Turing machine $L(M)$

Trivial properties

$P \text{ is trivial } \iff \forall M\in TM, P(M) = 1$

Theorems

$P \text{ is semantic} \iff (L(M_1) = L(M_2) \implies P(M_1) = P(M_2) )$

Rice's theorem

$P \text{ is semantic} \land P \text{ is not trivial } \implies L_{P} = {M \in TM : L(M)=1} \text{ is undecidable}$

All non-trivial semantic properties $P$ have an undecidable $L_P$

Proof

Since $A_{TM} \leq_{m} L_{P} \implies L_{P} \text{ is undecidable}$ one wants to show generally that $P \text{ is semantic} \land P \text{ is not trivial } \impliex A_{TM} \leq_{m} L_{P}$

Auxiliary Turing machine construction

Proving Rice's theorem requires the use of creating Turing machines within a Turing machine (auxiliary Turing machine). For this proof the goal is to find some way that the auxiliary TM satisfies the property $P$ iff $M \text{ accepts } w$. In this vein, let $X_{M,w}(x)$ be the auxiliary TM which accepts iff $M \text{ accepts } w \land H \text{accepts} x$ and otherwise loops.

Note that the language of this auxiliary TM is empty if $M \text{ rejects} w$ and satisfies $P$ when $M \text{ accepts } w$)

By WOC, assume $ L_{P} $ is decided by $H$. And let $H'(M,w)$ be a Turing machine that does the following:

Create an auxiliary Turing machine $X_{M,w}(x)$
Return $ H(X_{M,w})$

This $H'$ serves as the function $f$ to map language $A_{TM}$ to $L_{P}$, and since $A_{TM}$ is undecidable, so it $L_{P}$

Nondeterministic Turing machine

$\delta : Q \times \Gamma\to2^{Q \times \Gamma \times \{L,R\}} $

Since NTMs may take several paths to attempt to invoke an accept, we define acceptance by a history of 'configurations' $C_0,C_1,C_2,...$

They have the same power as normal Turing machines, as you can make a normal Turing machines by enumerating the configuration histories

Complexity Theory

Computational complexity

Analysis on the required resources such as time and memory an algorithm requires

Big-O O-grande 大Ｏ

See Discrete Mathematics

Note that Big-O ignores functional translations and multiplicative constants

Time complexity

For some machine $M$, it has a time complexity function $T : n \subseteq \mathbb{N} \to \mathbb{N}$ that takes the maximum amount of steps to execute $M$ for an input of length $n$

TIME complexity class

For sometime function $T(n)$, $\text{TIME}(T(n))$ is a set representing all languages decidable with an algorithm in $O(T(n))$

The languages that can be resolved with a time complexity $O(T(n))$

$ \text{TIME} (T(n)) = \{ L' : \text{There is a Turing machine with time complexity } O(T(n)) \text{ such that } L'=L(M) \}$

NTIME complexity class

The languages that can be resolved with a time complexity $O(T(n))$

$ \text{NTIME} (T(n)) = \{ L' : \text{There is a Nondeterministic Turing machine with time complexity } O(T(n)) \text{ such that } L'=L(M) \}$

Polynomial time (P)

$ P = \bigcup_{k \in \mathbb{N}} \text{TIME}(n^k)$

Nondeterministic polynomial time (NP)

$ NP = \bigcup_{k \in \mathbb{N}} \text{NTIME}(n^k)$

Cobham-Edmonds Thesis

Thesis stating that efficiently computable problems are in the set of polynomial

$L \text{ is feasibly computable } \iff L \in P $

Witness theorem

$L \in NP \iff \exists k \in \mathbb{N},V_{TM}(x,y) \text{ running in polynomial time} : L=\{ x : \exists y \in \Sigma^{*} (|y| \leq |x|^k \land V(x,y) \text{ accepts})\}$

$L \in NP \iff \exists k, V_{TM}(x,y) : L=\{ x : \exists y \in \Sigma^{*} (|y| \leq |x|^k \land V(x,y) \text{ accepts})\} \land L(V_{TM}) \in P$

Definition of Nondeterministic Turing machine acceptance

$M(w)$ accepts in time $T(n)$ if and only if there exists a history of configurations that accepts

Clique problem

Hamiltonian path problem

Polynomial time reducibility

Language A is poly-time reducible to B if there is a poly-time computable function that puts all elements in B in bijection with all elements in A

$A \leq_{P} B \iff \exists f : \Sigma^{*} \to \Sigma^{*}: \forall w \in A, f(w) \in B$

NP-Complete

$ L \in NP \land (\forall B \in NP, B \leq_P L) \iff L \in \text{ NP-Complete }$

SAT is an NP-Complete language

NP-Hard

$ (\forall B \in NP, B \leq_P L) \iff L \in \text{ NP-Hard }$

Cook-Levin theorem

SAT is NP-Complete (so all NP problems can be polynomial-time reduced to SAT). As a corrolary, $\text{SAT} \in P \iff P=NP $

Boolean functions

Function $\varphi : \{0,1\}^n \to \{0,1\} $ defined on boolean sets. The logical operators AND, OR, NOT, XOR, XAND, NAND etc. are examples of boolean functions

SAT

Language of all satisfiable boolean formulae

$\text{SAT} = \{ \varphi : \varphi \text{ is a satisfiable boolean formula } \} $

It is in NP since verifying a string to be in SAT runs in polynomial time due to the simple nature of boolean functions

It is also NP-Hard since one can devise a boolean formula that is logically equivalent to a TM in NTIME, that is, by passing an accepting configuration into the boolean formula it returns true. Consider some tableau with all the computing histories (this will be of size $n^{2k}$ cells since there are $n^k$ histories and any TM that runs in polynomial time can only access $n^k$ tape cells)

$ \varphi = \varphi_{\text{cell}} \land \varphi_{\text{start}} \land \varphi_{\text{accept}} \land \varphi_{\text{move}} $

$ \varphi_{\text{cell}}$

$ \varphi_{\text{cell}} = \bigwedge_{1 \leq i,j \leq n^k} [ (\bigvee_{s \in C} x_{i,j,s}) \land (\bigwedge_{s,t \in C,s\neq t} ( \neg x_{i,j,s} \lor \neg x_{i,j,t} ) ) ] $

$n$CNF-formula

Conjunctive Normal Form; where the AND operator acting on a number of clauses where the amount of literals in the entire CNF-formula is less that or equal to $n$

$\bigwedge_{k=0} c_k$

Clause

The OR operator acting on a number of literals

$c = \bigvee_{k=0} x_k$

Literal

A boolean variable or negation of a boolean variable

$x$

3SAT

$\text{3SAT} = \{ \varphi : \varphi \text{ is a satisfiable 3CNF-formula } \} $

3SAT is also NP-Complete, and by reducing other problems to 3SAT we can prove other problems to be NP-Complete too; in a similar way how we can show how languages reducing to $A_{TM}$ are decidable. You can, for instance, use 3SAT reductions to a logically equivalent graph to prove that the Hamiltonian path problem's language is NP-Complete

Zero knowledge proof (ZKP)

A system of proving that some algorithm is true without disclosing the algorithm itself

λ Calculus

Expressions

A valid lambda calculus program, $\Lambda$ represents the set of all expressions.

Variable

An element that represents a number liable to change. There are two types:

Free; variable that can be interpreted as any value in the universe of discourse, a wildcard
Bound; variable that is given a domain of values from the universe of discourse

$x$

Abstraction

An element passing a variable $x$ to a lambda expression $M$, the scope of $x$ is within $M$

$\lambda x.M$

Application

An element passing one lambda expression to another

$ (M N) $

Formal definition of expression

Using this knowledge, $\Lambda$ can be defined as a grammar such that an expression is a sequence of 'abstractions' and 'applications' of variables

$x \text{ is a variable } \implies x \in \Lambda$
$M \in \Lambda \implies (\lambda x.M) \in \Lambda$
$M,N \in \Lambda \implies (MN) \in \Lambda$

Reductions

$\alpha$ equivalence

The notion that bound variable names can be changed so long as it is within scope and no clashes occur

$ \lambda x.M[x] \equiv_{\alpha} \lambda y.M[y] $

$\beta$ reduction

The notion that an expression can be applied to another expression as substitution into the bound variable

$ (\lambda x.M[x])y \equiv_{\beta} M[x := y] $

$\eta$ reduction

The notion that an abstraction that applies its input to some internal expression is logically equivalent to that internal expression itself

$ \lambda x.(M[y] x) \equiv_{\eta} M[y := x] $

Currying

Multi parameter function syntax by nesting functions inside eachother

$\lambda x.\lambda y.xy$

Booleans

Church numerals

$\bar{n}=\lambda s.\lambda z.s^{n}(z) $ where:

$\bar{n}$ is the nth Church numeral

$S = \lambda s.\lambda z.(s(ns(z)) \implies S\overline{n} = \overline{n+1}$

π Calculus

Process

Name

A symbolic element

Channel

An element representing the medium through which a message travels

$c$

Variable

An element whose value may be transmitted and received on a channel

$x$

Constructs

Concurrency

Construct that permits two processes running simultneously (concurrently)

$P|Q$

Communication

Input

Construct that awaits a message on some channel, this message is then binded to some name

$c(x)$

Output

Construct that transmits a message on some channel, this message is then received by a concurrent process

$\overline{c}\langle y \rangle$

Replication

Construct that can create new instances of a defined process

$!P$

Name creation

$(\nu x)$

Nil-process

Construct that terminates a process

$0$

zaco | ザコ

41080 - Theory of Computing Science

Automata and Regular Languages

Model of computation

Sequential models

Word Parola 単語

Empty string

Language Linguaggio 言語

Language operators

Determinism Determinismo 決定論

Deterministic Finite Automaton (DFA)

Formal definition

Graphical representation

Product construction

State diagrams

Kleene star Stella Kleene クリーンの星

Nondeterministic Finite Automaton (NFA)

Formal definition

Multiple start

GNFA

DFA-NFA equivalence theorem

Proof

Regular language

Theorems

Regular expressions

Formal definition

Constants

Operations

Pumping lemma

Proof

Equivalence relation Relazione d'equivalenza

Distinguishable Strings

Indistinguishable Strings

Proof of equivalence relation

Myhill-Nerode Theorem

Context-free Languages

Stack

Pushdown Automata (PDA)

Formal definition

Stackbase

Graphical representation

Context-free language (CFL)

Context Free Grammar (CFG)

Formal definition

Derivation

CFG-PDA equivalence

PDA-CFG equivalence

Deterministic Pushdown Automaton (DPDA)

Formal definition

NPDA inequivalence

Deterministic context-free language (DCFL)

Top-down parser

LL(1) parser

Parsing table

FIRST sets

FOLLOW sets

Context-free Pumping lemma

Turing Machines and Computability Theory

Turing machine (TM)

Formal definition

Recognizability

Decidability

Turing completeness

Church-Turing Hypothesis

Computability theory

Data

Program

Turing machine encoding

Universal Turing Machine

Russell's Paradox

Russell's Barber Paradox

Unrecognizability

Proof

Undecidability

Examples

Proof methods

Proof for \(A_{TM}\)

Proof for \(HALT_{TM}\)

Proof for \(E_{TM}\)

Mapping reducibility