Julian Gonzalez

This homework covers material from Lecture 5: Foundational Theories of Computation and Lecture 6: Classifying Programming Languages. You will explore the four pillars of computer science, discover new computational models, reason about the limits of computation, and analyze how pragmatic design choices shape programming languages.

All questions require written responses. Show your work where applicable, and provide clear justifications for your answers. Several questions require independent research — cite your sources. Partial credit will be awarded based on the quality of your reasoning.

📝Written Section

100 points

Part 1: Language Theory (15 points)

Q1—Exploring Forms of Expression

15 pts

Part 2: Automata Theory (25 points)

In LN5, we studied Turing Machines — the most powerful computational model we know. But Turing Machines are complex: an infinite tape, a read/write head, arbitrary movement. What happens when we strip away that complexity?

In Part 2, you will discover two simpler classes of automata by researching them independently, building examples, and exploring their capabilities and limits. By the end, you will construct the complete hierarchy of computational power — connecting machines, languages, and decidability.

Q2—Guided Discovery — Finite Automata and Their Limits

12 pts

Q3—Guided Discovery — Pushdown Automata and the Machine Hierarchy

13 pts

Part 3: Computability Theory (25 points)

In LN5, we learned that most functions are not computable — there are more functions than programs. The proof uses the same diagonal argument from LN3's set theory. In Part 3, you will work through this argument yourself and connect the machine hierarchy you built in Part 2 to the fundamental limits of computation.

Q4—The Diagonal Argument — A Guided Proof

15 pts

Q5—Machine Capabilities Across the Hierarchy

10 pts

Q5a 4 pts In Q2 you built a DFA for binary parity, and in Q3 you built a PDA for balanced parentheses. Consider these three languages: Binary strings with an even number of 1s Balanced parentheses Strings of the form for For each language, state the weakest machine from the hierarchy you built in Q3d that can recognize it. Then explain why the next weaker machine cannot — what specific capability does it lack? Q5b 3 pts The Halting Problem (LN5) defines a language: the set of all (program, input) pairs where the program eventually halts. Where does this language sit relative to your hierarchy from Q3d? Is it at the DFA level, PDA level, TM level, or beyond? Can any machine in the hierarchy decide it — that is, always halt with the correct yes/no answer? Can a Turing Machine recognize it — that is, halt and accept if the answer is yes, but possibly loop forever if no? Explain the difference between deciding and recognizing. Q5c 3 pts In LN4, we showed that the Simply Typed Lambda Calculus always terminates — every well-typed expression has a normal form (strong normalization). This means its set of expressible computations is a proper subset of what a Turing Machine can do. Where on your hierarchy would you place the Simply Typed Lambda Calculus? Is it more or less powerful than a DFA? Than a PDA? Than a full TM? What does strong normalization buy you that a Turing-complete language cannot offer? (Hint: think about what the Halting Problem says about Turing-complete systems.) Connect this to the expressiveness-guarantees trade-off from LN5: why might a language designer intentionally choose to sacrifice Turing completeness?

Part 4: Complexity Theory (15 points)

Q6—Reducibility and Formalizing Complexity

15 pts

Q6a3 ptsDefining ReductionIn complexity theory, a polynomial-time reduction from problem to problem (written ) shows that is "no harder than" .Using the set theory and logic from LN2 and LN3: Define what means formally. What properties must the reduction function have? If and is in P, what can you conclude about ? Why? If and is NP-hard, what can you conclude about ? Why? Q6b5 ptsA Guided Reduction: Building Structural Transformation IntuitionConsider two classic NP-complete problems on graphs: Independent Set: Given a graph and an integer , does there exist a set of size such that no two vertices in are connected by an edge? Vertex Cover: Given a graph and an integer , does there exist a set of size such that every edge in has at least one endpoint in ? Consider the following graph with 5 vertices and 6 edges:Loading graph... Prove that is an independent set of size if and only if is a vertex cover of size . (Hint: think about what happens to an edge when one endpoint is in S versus not in S.) Construct the mapping function that transforms an Independent Set instance into a Vertex Cover instance. What does do to the graph? What does it do to ? Verify on the graph above: find an independent set of size 2 and confirm that its complement is a vertex cover of size 3. Looking ahead: Notice how a systematic structural transformation showed that two problems that look different are actually the same in disguise. This is the same skill you will use throughout this course: parsing transforms strings into trees, trimming a Concrete Syntax Tree to an Abstract Syntax Tree removes redundant structure while preserving meaning, and evaluating an AST into a semantic value transforms structure into computation. Each of these is a reduction — a systematic mapping between representations that preserves what matters. Q6c3 ptsFormalizing Complexity with Our ToolsThe formal tools from LN2 (logic) and LN3 (set theory) are exactly what we use to define complexity classes precisely. In this problem, we give you the formal components — your job is to explain what they mean and assemble them correctly.1. The formal definition of Big-O is:$$O(g(n)) = \{ f : \mathbb{N} \to \mathbb{N} \mid \exists c \in \mathbb{N}.\; \exists n_0 \in \mathbb{N}.\; \forall n \geq n_0.\; f(n) \leq c \cdot g(n) \}$$This single definition uses three of our formal tools simultaneously. Identify each one and explain its role: Set theory: The outer curly braces define a set. A set of what? What does the vertical bar ("such that") separate? Logical quantifiers: There are two existential and one universal quantifier. In plain English, what does each quantifier "do" in this definition? Why is the order (existential before universal) important — what would change if the universal came first? Functions: The elements of the set are themselves functions from the naturals to the naturals. How does this connect to the set-theoretic function concept from LN3? 2. Now define the complexity class P. We give you the building blocks — assemble them into a single formal definition using set-builder notation: Building blocks: decision problem L, Turing Machine M, decides (halts with correct answer on every input), polynomial p(n), time steps, input length Skeleton: $$\mathbf{P} = \{ L \mid \exists \_\_\_.\; \exists \_\_\_.\; \forall \_\_\_.\; \_\_\_ \}$$Fill in each blank and explain your choices. (Hint: you need to say there exists a machine and a polynomial such that for every input, the machine decides correctly within the polynomial bound.)3. Define NP similarly. The key difference is that NP uses a verifier instead of a decider: Additional building block: certificate c (a "proof" that the answer is yes), the certificate is polynomial-length Skeleton: $$\mathbf{NP} = \{ L \mid \exists \_\_\_.\; \exists \_\_\_.\; \forall \_\_\_.\; (\_\_\_ \Leftrightarrow \exists \_\_\_.\; \_\_\_) \}$$Fill in the blanks. Then explain: why does NP have an extra existential quantifier that P does not? What is being "guessed" versus "checked"?Q6d4 ptsComplexity AnalysisConsider this Python function:def find_pair(items, target): for i in range(len(items)): for j in range(i + 1, len(items)): if items[i] + items[j] == target: return (i, j) return None 1. Formally prove the worst-case time complexity of find_pair using the Big-O definition from Q6c. We break this into guided steps: Step A — Count the work: In the worst case (no pair found), the outer loop runs n times. For each iteration i, the inner loop runs n - i - 1 times. Write the total number of comparisons as a summation: $$f(n) = \sum_{i=0}^{n-1} \_\_\_$$ Step B — Simplify: Evaluate the summation. (Hint: recall the identity below) $$\sum_{k=0}^{m} k = \frac{m(m+1)}{2}$$ Step C — Apply the definition: Choose g(n) = n². Using your simplified f(n), find specific values of c and n₀ such that f(n) ≤ c · g(n) for all n ≥ n₀. Write your answer in this form: $$\text{Let } c = \_\_\_ \text{ and } n_0 = \_\_\_. \text{ Then for all } n \geq n_0: \; f(n) = \_\_\_ \leq \_\_\_ = c \cdot n^2$$2. Is the problem solved by find_pair in P? Use your definition of P from Q6c. (Hint: you just proved its time complexity — does it fit within a polynomial bound? What would the polynomial be?)3. Is it in NP? Describe a polynomial-time verifier: given a proposed solution (a pair of indices ), what does the verifier check, and how many steps does it take? (Hint: a verifier does not need to search — it only confirms a given answer.)

Part 5: Language Pragmatics (20 points)

In LN6, we explored how programming languages force programmers to think at different levels of abstraction — and how choosing the right tool is a pragmatic decision about matching the level of concern to the problem domain. In Part 5, you will research real languages and apply the pragmatic principles from lecture.

Q7—Paradigmatic Friction

12 pts

For each language pair below, research both languages and perform a comparative pragmatic analysis. You should look at documentation, tutorials, and example programs in each language. Cite your sources. Q7a 6 pts Functional Pair: Erlang vs. Haskell Both Erlang and Haskell are functional programming languages, but they were designed for very different problem domains and make very different pragmatic trade-offs. Identify 3 pragmatic concerns that both languages share (e.g., both emphasize immutability, both use pattern matching, both favor recursion over loops). Identify 3 pragmatic concerns where they diverge (e.g., Erlang's concurrency model vs. Haskell's, purity enforcement, type system design, runtime philosophy, error handling). Describe a specific task that is natural in one language but awkward in the other. Explain what pragmatic design decision creates this friction. Referring to the 8 building blocks from LN6 (names/binding/scope, evaluation, control flow, types, functions, modules, concurrency, metaprogramming), identify which building block differs most between Erlang and Haskell and explain how. Q7b 6 pts Systems Pair: Rust vs. C++ Both Rust and C++ are systems programming languages that give the programmer fine-grained control over memory and performance. But they take fundamentally different approaches to safety and correctness. Perform the same analysis as Q7a: 3 shared pragmatic concerns 3 divergent pragmatic concerns A specific task natural in one but awkward in the other, with explanation of the friction Which of the 8 building blocks differs most, and how?

Q8—Pragmatic Principles in the Wild

8 pts

Submit to Brightspace

Submission Guidelines

Submit your responses as a single PDF to Brightspace
Clearly label each question and subquestion (Q1a, Q1b, etc.)
For state diagrams and automata traces, draw them clearly (digital or neatly hand-drawn)
For the guided proof (Q4), show each step explicitly — do not skip reasoning
For research questions (Q1d, Q2a, Q3a, Q7, Q8c), cite your sources
Show your reasoning — partial credit is awarded for correct methodology

Point Summary

Section	Question	Points
Part 1: Language Theory	Q1: Exploring Forms of Expression	15
Part 2: Automata Theory	Q2: Guided Discovery — Finite Automata	12
	Q3: Guided Discovery — PDAs + Hierarchy	13
Part 3: Computability Theory	Q4: The Diagonal Argument (Guided Proof)	15
	Q5: Machine Capabilities Across the Hierarchy	10
Part 4: Complexity Theory	Q6: Reducibility and Formalizing Complexity	15
Part 5: Language Pragmatics	Q7: Paradigmatic Friction (Research)	12
	Q8: Pragmatic Principles in the Wild	8
Total		100

⭐Optional Section

20 points

Building the Validator — Decidability by Design

This is Stage 3 of the WebAssembly project. In HW3, you reasoned about what machines can and cannot decide — the halting problem, the Chomsky hierarchy, and the trade-off between expressiveness and guarantees. In this optional, you will implement a decidable validation algorithm for WebAssembly and understand why it is decidable.

Wasm was deliberately designed so that validation (type-checking an entire module) runs in linear time. This is not an accident — it is a consequence of design decisions that restrict Wasm's control flow in ways that map directly to the Chomsky hierarchy you studied. You will implement the validator and then reason about what makes it work.

If you completed the HW1 optional (the type checker), you may use your own implementation here. Otherwise, the scaffolding below includes a reference type checker you can build on.

Spec Sections to Read

Validation → Modules — How entire modules are validated: checking function types, verifying imports/exports, validating function bodies.
Appendix → Validation Algorithm — This is the key reading. The spec provides a complete validation algorithm and proves it is equivalent to the declarative typing rules from Stage 1. The algorithm runs in linear time on the instruction sequence. Read this and understand why linear-time validation is possible.
Introduction → Overview → Semantic Phases — Re-read the "Validation" paragraph: "Validation checks a number of well-formedness conditions to guarantee that the module is meaningful and safe."

Your Task

Implement a function validate(module) that takes a Wasm module (as a structured object) and returns either valid or a list of validation errors. Your validator should check:

Type consistency: Every function body must type-check against its declared signature (using your Stage 1 type checker or the reference implementation below).
Index validity: Every local.get x references an index that exists in the function's local declarations; every call x references a valid function index.
Stack balance: Every block and if must have a matching end; the operand stack must be balanced at every join point.

You may use any programming language.

Scaffolding

Module structure — the input to your validator:

Module = {
  types: functype[]                    -- type declarations
  funcs: Function[]                    -- function definitions
}

Function = {
  typeIndex: int                       -- index into Module.types
  locals: valtype[]                    -- declared local variables (params + locals)
  body: Instruction[]                  -- instruction sequence
}

Reference type checker (if you did not complete Stage 1):

typecheck(instructions, context) -> instrtype | error

You may assume this function exists and works correctly. Implement it yourself, or use the pseudocode from the HW1 optional scaffolding.

Test modules:

Test 1 — Valid module with two functions:

Module {
  types: [{ params: ["i32", "i32"], results: ["i32"] },
          { params: ["i32"], results: ["i32"] }],
  funcs: [
    { typeIndex: 0, locals: ["i32", "i32"],
      body: [local.get 0, local.get 1, i32.add] },
    { typeIndex: 1, locals: ["i32"],
      body: [local.get 0, local.get 0, i32.mul] }
  ]
}

Expected: valid.

Test 2 — Invalid: function body returns wrong type:

Module {
  types: [{ params: ["i32"], results: ["i32"] }],
  funcs: [
    { typeIndex: 0, locals: ["i32"],
      body: [local.get 0, drop] }
  ]
}

Expected: error — function body produces [] but signature expects ["i32"].

Test 3 — Invalid: out-of-bounds local index:

Module {
  types: [{ params: [], results: ["i32"] }],
  funcs: [
    { typeIndex: 0, locals: [],
      body: [local.get 5] }
  ]
}

Expected: error — local index 5 does not exist.

Test 4 — Valid module with control flow:

Module {
  types: [{ params: ["i32"], results: ["i32"] }],
  funcs: [
    { typeIndex: 0, locals: ["i32"],
      body: [local.get 0, if (result i32) [i32.const 1] else [i32.const 0] end] }
  ]
}

Expected: valid.

Reflection

After implementing the validator, answer these questions (include them as comments in your code or in a separate REFLECTION.md):

Wasm's control flow is structured — no arbitrary goto, only block/loop/if with br targeting labels. How does this restriction relate to the Chomsky hierarchy from HW3? At what level of the hierarchy does Wasm's control flow sit? What would change if Wasm allowed arbitrary jumps?
The validation algorithm runs in time. Why? What property of Wasm's design makes this possible? (Hint: the structured control flow means the validator never needs to backtrack.)
SQL, regex, and CSS were discussed in HW3 as intentionally sub-Turing-complete. Is Wasm Turing-complete? What does Wasm sacrifice (compared to native x86 code) and what does it gain?

Deliverable

Submit your validator source code, verify it handles all 4 test modules correctly, and include your reflection answers.

Loading content...

Part 1: Language Theory (15 points)

Q1—Exploring Forms of Expression

15 pts

Part 2: Automata Theory (25 points)

Q2—Guided Discovery — Finite Automata and Their Limits

12 pts

Q3—Guided Discovery — Pushdown Automata and the Machine Hierarchy

13 pts

Part 3: Computability Theory (25 points)

Q4—The Diagonal Argument — A Guided Proof

15 pts

Q5—Machine Capabilities Across the Hierarchy

10 pts

Q5a 4 pts In Q2 you built a DFA for binary parity, and in Q3 you built a PDA for balanced parentheses. Consider these three languages: Binary strings with an even number of 1s Balanced parentheses Strings of the form for For each language, state the weakest machine from the hierarchy you built in Q3d that can recognize it. Then explain why the next weaker machine cannot — what specific capability does it lack? Q5b 3 pts The Halting Problem (LN5) defines a language: the set of all (program, input) pairs where the program eventually halts. Where does this language sit relative to your hierarchy from Q3d? Is it at the DFA level, PDA level, TM level, or beyond? Can any machine in the hierarchy decide it — that is, always halt with the correct yes/no answer? Can a Turing Machine recognize it — that is, halt and accept if the answer is yes, but possibly loop forever if no? Explain the difference between deciding and recognizing. Q5c 3 pts In LN4, we showed that the Simply Typed Lambda Calculus always terminates — every well-typed expression has a normal form (strong normalization). This means its set of expressible computations is a proper subset of what a Turing Machine can do. Where on your hierarchy would you place the Simply Typed Lambda Calculus? Is it more or less powerful than a DFA? Than a PDA? Than a full TM? What does strong normalization buy you that a Turing-complete language cannot offer? (Hint: think about what the Halting Problem says about Turing-complete systems.) Connect this to the expressiveness-guarantees trade-off from LN5: why might a language designer intentionally choose to sacrifice Turing completeness?

Part 4: Complexity Theory (15 points)

Q6—Reducibility and Formalizing Complexity

15 pts

Q6a3 ptsDefining ReductionIn complexity theory, a polynomial-time reduction from problem to problem (written ) shows that is "no harder than" .Using the set theory and logic from LN2 and LN3: Define what means formally. What properties must the reduction function have? If and is in P, what can you conclude about ? Why? If and is NP-hard, what can you conclude about ? Why? Q6b5 ptsA Guided Reduction: Building Structural Transformation IntuitionConsider two classic NP-complete problems on graphs: Independent Set: Given a graph and an integer , does there exist a set of size such that no two vertices in are connected by an edge? Vertex Cover: Given a graph and an integer , does there exist a set of size such that every edge in has at least one endpoint in ? Consider the following graph with 5 vertices and 6 edges:Loading graph... Prove that is an independent set of size if and only if is a vertex cover of size . (Hint: think about what happens to an edge when one endpoint is in S versus not in S.) Construct the mapping function that transforms an Independent Set instance into a Vertex Cover instance. What does do to the graph? What does it do to ? Verify on the graph above: find an independent set of size 2 and confirm that its complement is a vertex cover of size 3. Looking ahead: Notice how a systematic structural transformation showed that two problems that look different are actually the same in disguise. This is the same skill you will use throughout this course: parsing transforms strings into trees, trimming a Concrete Syntax Tree to an Abstract Syntax Tree removes redundant structure while preserving meaning, and evaluating an AST into a semantic value transforms structure into computation. Each of these is a reduction — a systematic mapping between representations that preserves what matters. Q6c3 ptsFormalizing Complexity with Our ToolsThe formal tools from LN2 (logic) and LN3 (set theory) are exactly what we use to define complexity classes precisely. In this problem, we give you the formal components — your job is to explain what they mean and assemble them correctly.1. The formal definition of Big-O is:$$O(g(n)) = \{ f : \mathbb{N} \to \mathbb{N} \mid \exists c \in \mathbb{N}.\; \exists n_0 \in \mathbb{N}.\; \forall n \geq n_0.\; f(n) \leq c \cdot g(n) \}$$This single definition uses three of our formal tools simultaneously. Identify each one and explain its role: Set theory: The outer curly braces define a set. A set of what? What does the vertical bar ("such that") separate? Logical quantifiers: There are two existential and one universal quantifier. In plain English, what does each quantifier "do" in this definition? Why is the order (existential before universal) important — what would change if the universal came first? Functions: The elements of the set are themselves functions from the naturals to the naturals. How does this connect to the set-theoretic function concept from LN3? 2. Now define the complexity class P. We give you the building blocks — assemble them into a single formal definition using set-builder notation: Building blocks: decision problem L, Turing Machine M, decides (halts with correct answer on every input), polynomial p(n), time steps, input length Skeleton: $$\mathbf{P} = \{ L \mid \exists \_\_\_.\; \exists \_\_\_.\; \forall \_\_\_.\; \_\_\_ \}$$Fill in each blank and explain your choices. (Hint: you need to say there exists a machine and a polynomial such that for every input, the machine decides correctly within the polynomial bound.)3. Define NP similarly. The key difference is that NP uses a verifier instead of a decider: Additional building block: certificate c (a "proof" that the answer is yes), the certificate is polynomial-length Skeleton: $$\mathbf{NP} = \{ L \mid \exists \_\_\_.\; \exists \_\_\_.\; \forall \_\_\_.\; (\_\_\_ \Leftrightarrow \exists \_\_\_.\; \_\_\_) \}$$Fill in the blanks. Then explain: why does NP have an extra existential quantifier that P does not? What is being "guessed" versus "checked"?Q6d4 ptsComplexity AnalysisConsider this Python function:def find_pair(items, target): for i in range(len(items)): for j in range(i + 1, len(items)): if items[i] + items[j] == target: return (i, j) return None 1. Formally prove the worst-case time complexity of find_pair using the Big-O definition from Q6c. We break this into guided steps: Step A — Count the work: In the worst case (no pair found), the outer loop runs n times. For each iteration i, the inner loop runs n - i - 1 times. Write the total number of comparisons as a summation: $$f(n) = \sum_{i=0}^{n-1} \_\_\_$$ Step B — Simplify: Evaluate the summation. (Hint: recall the identity below) $$\sum_{k=0}^{m} k = \frac{m(m+1)}{2}$$ Step C — Apply the definition: Choose g(n) = n². Using your simplified f(n), find specific values of c and n₀ such that f(n) ≤ c · g(n) for all n ≥ n₀. Write your answer in this form: $$\text{Let } c = \_\_\_ \text{ and } n_0 = \_\_\_. \text{ Then for all } n \geq n_0: \; f(n) = \_\_\_ \leq \_\_\_ = c \cdot n^2$$2. Is the problem solved by find_pair in P? Use your definition of P from Q6c. (Hint: you just proved its time complexity — does it fit within a polynomial bound? What would the polynomial be?)3. Is it in NP? Describe a polynomial-time verifier: given a proposed solution (a pair of indices ), what does the verifier check, and how many steps does it take? (Hint: a verifier does not need to search — it only confirms a given answer.)

Part 5: Language Pragmatics (20 points)

Q7—Paradigmatic Friction

12 pts

For each language pair below, research both languages and perform a comparative pragmatic analysis. You should look at documentation, tutorials, and example programs in each language. Cite your sources. Q7a 6 pts Functional Pair: Erlang vs. Haskell Both Erlang and Haskell are functional programming languages, but they were designed for very different problem domains and make very different pragmatic trade-offs. Identify 3 pragmatic concerns that both languages share (e.g., both emphasize immutability, both use pattern matching, both favor recursion over loops). Identify 3 pragmatic concerns where they diverge (e.g., Erlang's concurrency model vs. Haskell's, purity enforcement, type system design, runtime philosophy, error handling). Describe a specific task that is natural in one language but awkward in the other. Explain what pragmatic design decision creates this friction. Referring to the 8 building blocks from LN6 (names/binding/scope, evaluation, control flow, types, functions, modules, concurrency, metaprogramming), identify which building block differs most between Erlang and Haskell and explain how. Q7b 6 pts Systems Pair: Rust vs. C++ Both Rust and C++ are systems programming languages that give the programmer fine-grained control over memory and performance. But they take fundamentally different approaches to safety and correctness. Perform the same analysis as Q7a: 3 shared pragmatic concerns 3 divergent pragmatic concerns A specific task natural in one but awkward in the other, with explanation of the friction Which of the 8 building blocks differs most, and how?

Q8—Pragmatic Principles in the Wild

8 pts

Course Planner

Final Exam Release

HW 6

Final Exam Due

HW 3

Topics Covered:

📝Written Section

Part 1: Language Theory (15 points)

Part 2: Automata Theory (25 points)

Part 3: Computability Theory (25 points)

Part 4: Complexity Theory (15 points)

Part 5: Language Pragmatics (20 points)

Submission Guidelines

Point Summary

⭐Optional Section

Building the Validator — Decidability by Design

Spec Sections to Read

Your Task

Scaffolding

Reflection

Deliverable

All Homeworks

📝Written Section

Part 1: Language Theory (15 points)

Part 2: Automata Theory (25 points)

Part 3: Computability Theory (25 points)

Part 4: Complexity Theory (15 points)

Part 5: Language Pragmatics (20 points)

Submission Guidelines

Point Summary

⭐Optional Section

Building the Validator — Decidability by Design

Spec Sections to Read

Your Task

Scaffolding

Reflection

Deliverable