Knowledge Space Theory

How to

This tutorial consists of two basic parts: Background information on knowledge space theory (KST) and Shiny Apps, which address certain concepts of it.

For a refresher, read through the texts about KST. Interactive apps are highlighted with dotted light gray borders. In these, you can usually visualize, calculate or otherwise interact with one or more of the previously mentioned aspects.

Give it a try ;)

Acknowledgement

TquanT QHELP

The various apps were developed by participants as part of the TquanT and QHELP programs and often extended by Cord Hockemeyer. For a more consistent tutorial experience, they were adjusted and redundant content was removed. For more informations, the original apps and background information please have a look at the linked web pages.

The texts are partly taken from a talk by Jürgen Heller and Florian Wickelmaier, which was presented during TquanT.

Many thanks to all the students who developed the apps as part of the QHELP and TquanT mobility events and Alice Maurer for helpful comments and corrections.

If you want to learn more exciting concepts of KST, check out the pages of TquanT and QHELP. There you will find more student-developed apps related to KST.

Probabilistic KST

Motivation

In practical applications we cannot assume that a student’s response to an item is correct if and only if the student masters it (i.e. if the item is an element of the respective knowledge state).
There are two types of response errors:
- careless error, i.e. the response is incorrect although the item is contained in the knowledge state
- lucky guess, i.e. the response is correct although the item is not contained in the knowledge state.
In any case, we need to dissociate knowledge states and response patterns.
- \(R\) denotes a response pattern, which is a subset of \(Q\)
- \(\mathcal{R} = 2^Q\) denotes the set of all possible response patterns
In this approach
- the knowledge state K is a latent construct
- the response pattern R is a manifest indicator to the knowledge state.
The conclusion from the observable response pattern to the unobservable knowledge state can only be of a stochastic nature.
This requires to introduce a probabilistic framework.

A probabilistic knowledge structure (PKS) is defined by specifying

a knowledge structure \(K\) on a knowledge domain \(Q\) (i.e. a collection \(K \subseteq 2^Q\) with \(\emptyset, Q \in K\))
a (marginal) distribution \(P(K)\) on the knowledge states \(K \in \mathcal{K}\)
the conditional probabilities \(P(R | K)\) to observe response pattern \(R\) given knowledge state \(K\) \[P(R | K) = \frac{P(R, K)}{ P(K)}\]

The probability of the response pattern \(R \in \mathcal{R} = 2^Q\) is predicted by (cf. law of total probability)

\[P(R) = \sum\limits_{K \in \mathcal{K}} P(R | K) \cdot P(K)\]

Local Stochastic Independence

Assumptions:

Given the knowledge state \(K\) of a person
- the responses are stochastically independent over problems
- the response to each problem \(q\) only depends on the probabilities
  - \(\beta_q\) of a careless error
  - \(\eta_q\) of a lucky guess
The probability of the response pattern \(R\) given the knowledge state \(K\) reads

\[P(R | K) = \left(\prod\limits_{q \in K \setminus R} \beta_q \right) \cdot \left(\prod\limits_{q \in K \cap R} (1 - \beta_q) \right) \cdot \left(\prod\limits_{q \in R \setminus K} \eta_q \right) \cdot \left(\prod\limits_{q \in \overline{R} \cap \overline{K}} (1 - \eta_q) \right) \cdot\]

A PKS satisfying these assumptions is called a basic local independence model (BLIM).

Simulating BLIM using predefined structures

Task: Choose a structure and see how the simulated response patterns change when the error rates are modified.

Example Spaces

As example data, knowledge spaces provided by the R package pks (Heller & Wickelmaier, 2013; Wickelmaier et al., 2016) are used. Concretely, the following spaces are used:

Density: Taagepera et al. (1997) applied knowledge space theory to specific science problems. The density test was administered to 2060 students, a sub structure of five items is included here.
Matter: Taagepera et al. (1997) applied knowledge space theory to specific science problems. The conservation of matter test was administered to 1620 students, a sub structure of five items is included here.
Doignon & Falmagne: Fictitious data set from Doignon and Falmagne (1999, chap. 7).

References

Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge spaces. Berlin: Springer.

Heller, J. & Wickelmaier, F. (2013). Minimum discrepancy estimation in probabilistic knowledge structures. Electronic Notes in Discrete Mathematics, 42, 49-56.

Schrepp, M., Held, T., & Albert, D. (1999). Component-based construction of surmise relations for chess problems. In D. Albert & J. Lukas (Eds.), Knowledge spaces: Theories, empirical research, and applications (pp. 41--66). Mahwah, NJ: Erlbaum.

Taagepera, M., Potter, F., Miller, G.E., & Lakshminarayan, K. (1997). Mapping students' thinking patterns by the use of knowledge space theory. International Journal of Science Education, 19, 283--302.

Wickelmaier, F., Heller, J., & Anselmi, P. (2016). pks: Probabilistic Knowledge Structures. R package version 0.4-0. https://CRAN.R-project.org/package=kst

About the Theory

In Knowledge Space Theory, a knowledge structure 𝒦 is any family of subsets of a set Q of items (or test problems) which contains the empty set {} and the full item set Q. Such a knowledge structure can be used together with the BLIM model to simulate response patterns.

The Basic Local Independence Model (BLIM)

Assumption Given the knowledge state K of a person, the responses are stochastically independent over problems and the response to each problem q∈Q only depends on the probabilities β_q of a careless error and η_q of a lucky guess for item q.

In this app, we simplify the BLIM a bit further by assuming identical β and η values for all items.

Simulating BLIM using items

Follow the instructions and click through the tabs.

Introduction

This app demonstrates the simulation of response patterns with the BLIM model.

The app works with a set of five items on elementary arithmetics following the knowledge space used as standard example by Doignong & Falmagne (1999, Knowledge Spaces, chapter 7). As a first step, the user has to select a subset of at least three items. Then a BLIM simulation producing fictitious response pattens is run, and the frequencies of response patterns are depicted in a histogram. The patterns are sorted according to the lexical order of their binary representation.

BLIM Simulation

About this App

This app an adaptation of an app written by students at the TquanT Seminar 2017 in Deutschlandsberg, Austria.

It was adapted by Cord Hockemeyer, University of Graz, Austria.

TquanT was co-funded by the Erasmus+ Programme of the European Commission.

Maximum Likelihood Estimation

Basics:

The data consist of a vector specifying for each subject in a sample of size \(N\) the given response pattern.
From this we can derive the absolute frequencies \(N_R\) of the patterns \(R \in \mathcal{R}\).
For a given knowledge structure \(K\) the likelihood is given by \[\mathcal{L}(\beta, \eta; \pi | x) = \prod\limits_{R \in \mathcal{R}} P(R | \beta, \eta, \pi)^{N_R}\] \(\beta = (\beta_q)_{q \in Q}\)
\(\eta = (\eta_q)_{q \in Q}\)
\(\pi = (\pi_K)_{K \in \mathcal{K}} \;\; \text{with} \;\; \pi_K = P(K)\)

Determining the maximum likelihood estimates (MLEs) requires to compute the partial derivatives of the log-likelihood with respect to each of the parameters collected in the vectors \(\beta, \eta, \pi\).

The problems concerning the analytical tractability of this derivation arise from the fact that \(P(R| \mathcal{K}, \beta,\eta, \pi)\) actually is the sum \[P(R | \beta, \eta, \pi) = \sum\limits_{K \in \mathcal{K}} P(R, K | \beta, \eta, \pi)\]
Doignon & Falmagne (1999) thus resort to numerical optimization techniques.
A (partial) solution to this problem is provided by the so-called EM algorithm (Heller & Wickelmaier, 2013; Stefanutti & Robusto, 2009).

EM algorithm

The EM algorithm is an iterative optimization method for providing MLEs of unknown parameters, which proceeds within a so-called incomplete-data framework.
Considering the given data as incomplete, and (artificially) extending them by including actually unobservable variables (‘unknown data’) often facilitates the computation of the MLEs considerably.
In the present context we assume that for each subject we observe both the given response pattern \(R\) and the respective knowledge state \(K\) (complete data), thus having available the absolute frequencies \(M_{RK}\) of subjects who are in state \(K\) and produce pattern \(R\).
The likelihood of the complete data then reads

\[\mathcal{L}(\beta, \eta; \pi | x, y) = \prod\limits_{R \in \mathcal{R}} \prod\limits_{K \in \mathcal{K}} P(R, K | \beta, \eta, \pi)^{M_RK}\]

The terms containing \(\beta, \eta\) and \(\pi\) respectively, can be maximized independently and MLEs \(\beta^{(t)}, \eta^{(t)}\) and \(\pi^{(t)}\) are obtained.
In a second step the expected frequencies \(M_{RK}\) are calculated with the updated MLEs \(\beta^{(t)}, \eta^{(t)}\) and \(\pi^{(t)}\) \[\mathcal{E}(M_{RK} ) = N_R · P(K | R, \beta^{(t)}, \eta^{(t)}, \pi^{(t)})\]
By repeating these two steps, the MLEs converge and the final estimators are obtained.

Parameter Estimation

Task: Read through the information on the different estimation methods. Then switch to the second tab.

First choose the absolute frequencies of different answer patterns.
Then decide in step 2 which quantities should be included as knowledge states in the structure.
- (In a first step you can use the default values at step 1 and 2).
In step 3 then estimate and visualize the parameters with the different estimation methods.
Finally, at 4 and 5 you can output the expected frequencies of the response patterns and simulated response frequencies.

Estimation methods
Comparison of estimation methods

The three estimation methods

With a given knowledge structure and observed response patterns, there are three methods to estimate the parameters of a basic local independence model (BLIM):

Maximum Likelihood (ML) Estimation
Minimum Discrepancy (MD) Estimation
Minimum Discrepancy ML Estimation (MDML)

Method	Principle	Pros	Cons
ML	estimates parameters that maximize the probability of the observed data	driven by the likelihood of the data (approximately) unbiased estimates	iterative (EM algorithm) might inflate error rates for good fit
MD	assumes that any response pattern is generated by the knowledge state closest to it	computationally efficient (explicit estimators) avoids inflating the error rates	ignores the likelihood of the data estimates not unbiased
MDML	ML estimation under certain MD restrictions	minimizes the expected number of response errors maximizes the likelihood under this constraint reference for quantifying the amount of fit obtained by inflating error rates	estimates not unbiased

References

Doignon, J.-P., & Falmagne, J.-C. (1999). Knowledge spaces. Springer. https://doi.org/10.1007/978-3-642-58625-5

Heller, J., & Wickelmaier, F. (2013). Minimum discrepancy estimation in probabilistic knowledge structures. Electronic Notes in Discrete Mathematics, 42, 49–56. https://doi.org/10.1016/j.endm.2013.05.145

Stefanutti, L., & Robusto, E. (2009). Recovering a probabilistic knowledge structure by constraining its parameter space. Psychometrika, 74, 83–96. https://doi.org/10.1007/s11336-008-9095-7

Quiz

Congratulations

You have finished the tutorial. If you want to learn more exciting concepts of KST, check out the pages of TquanT and QHELP. There you will find more student-developed apps related to KST.

TquanT QHELP

How to

Acknowledgement

Deterministic KST

Basic terms

Properties of relations

Your Relation

Properties of R

Hasse Diagram

Surmise Relation

Enter your Prerequisite Relation!

Incidence Matrices

Theorem of G. Birkhoff (1937)

Validating Knowledge Structures

How well does the knowledge structure fit to the data?

Choose a data set:

Choose a coefficient:

Glossary of Coefficients

About the Data

Example Spaces

References

Patterns of responses not included in the diagram:

Distance Distribution

Deterministic Assesment

Don't forget to press the 'Done' button when you are finished

Please answer the following questions

The Assessment is completed.

Still eligible knowledge states

The Assessment is completed.

Still eligible knowledge states

The Assessment is completed.

Still eligible knowledge states

The Assessment is completed.

Still eligible knowledge states

The Assessment is completed.

Still eligible knowledge states

Probabilistic KST

Motivation

Simulating BLIM using predefined structures

Example Spaces

References

About the Theory

The Basic Local Independence Model (BLIM)

Simulating BLIM using items

Introduction

BLIM Simulation

Select the items to include in the test

Glossary of terms

About this App

Maximum Likelihood Estimation

Parameter Estimation

The three estimation methods

For this example, we will be working with a knowledge domain of four items: Q = {a,b,c,d}

1) Set the observed response frequencies of ...

2) Set your knowledge structure 𝒦

You have selected the following knowledge structure:

3) Calculate your parameter estimates

The error rate estimates for each item are:

The estimates of the state probabilities are:

4) Calculated expected frequencies with each parameter set

5) Simulate response patterns with each parameter set

Probabilistic Knowledge Assessment

Don't forget to press the 'Done' button when you are finished

Please answer the following questions

The Assessment is completed.

Your probabilites of knowledge states: P(K|R)

The Assessment is completed.

Your probabilites of knowledge states: P(K|R)

The Assessment is completed.

Your probabilites of knowledge states: P(K|R)

The Assessment is completed.

Your probabilites of knowledge states: P(K|R)

The Assessment is completed.

Your probabilites of knowledge states: P(K|R)

References

Quiz

Congratulations

Knowledge Space Theory

An interactive tutorial using Shiny apps developed as part of the TquanT and QHELP programm. Compiled and adapted by Julian Mollenhauer

An interactive tutorial using Shiny apps developed as part of the TquanT and QHELP programm.
Compiled and adapted by Julian Mollenhauer