Chart Semirings¶

Semiring abstractions for parameterizing chart parsing (Goodman, 1999). Different semirings yield different parsing algorithms from the same CKY skeleton: marginal log-probability, Viterbi best-parse, boolean recognition, or derivation counting.

semiring ¶

Semiring abstractions for parameterizing chart parsing.

A semiring (S, ⊕, ⊗, 0, 1) provides the algebraic structure for chart parsing (Goodman, 1999). Different semirings yield different parsing algorithms from the same CKY skeleton:

LogProbSemiring — marginal log-probability (logsumexp / +)
ViterbiSemiring — best-derivation log-probability (max / +)
BooleanSemiring — recognition (or / and)
CountingSemiring — derivation counting (+ / ×)

Each semiring defines:

times(a, b) — multiplicative combination (⊗)
plus(scores, dim) — additive aggregation (⊕) over a dimension
zero — additive identity (0): score for impossible derivations
one — multiplicative identity (1): score for trivial derivations

Categorical perspective

Semirings are the decategorification of rig categories (bimonoidal categories). The times operation corresponds to the tensor product, plus to the coproduct. Chart parsing is a functor from the free monoidal category (grammar) to the semiring.

The connection to quantales: a quantale is a complete lattice with an associative binary operation distributing over arbitrary joins. The log-probability semiring is the quantale ([-∞, 0], logsumexp, +).

ChartSemiring ¶

Bases: ABC

Abstract semiring for chart parsing.

Subclasses implement the four semiring operations. All operations must be differentiable (where applicable) to support gradient-based learning.

zero `abstractmethod` `property` ¶

zero: float

Additive identity: the score of an impossible derivation.

one `abstractmethod` `property` ¶

one: float

Multiplicative identity: the score of a trivial derivation.

times `abstractmethod` ¶

times(a: Tensor, b: Tensor) -> Tensor

Multiplicative combination (⊗).

Combines scores from two chart cells being joined by a rule.

PARAMETER	DESCRIPTION
`a`	Left operand scores. TYPE: `Tensor`
`b`	Right operand scores. TYPE: `Tensor`

RETURNS	DESCRIPTION
`Tensor`	Combined scores.

Source code in src/quivers/stochastic/semiring.py

@abstractmethod
def times(self, a: torch.Tensor, b: torch.Tensor) -> torch.Tensor:
    """Multiplicative combination (⊗).

    Combines scores from two chart cells being joined by a rule.

    Parameters
    ----------
    a : torch.Tensor
        Left operand scores.
    b : torch.Tensor
        Right operand scores.

    Returns
    -------
    torch.Tensor
        Combined scores.
    """

plus `abstractmethod` ¶

plus(scores: Tensor, dim: int) -> Tensor

Additive aggregation (⊕) over a dimension.

Marginalizes over multiple derivations of the same span.

PARAMETER	DESCRIPTION
`scores`	Scores to aggregate. TYPE: `Tensor`
`dim`	Dimension to aggregate over. TYPE: `int`

RETURNS	DESCRIPTION
`Tensor`	Aggregated scores.

Source code in src/quivers/stochastic/semiring.py

@abstractmethod
def plus(self, scores: torch.Tensor, dim: int) -> torch.Tensor:
    """Additive aggregation (⊕) over a dimension.

    Marginalizes over multiple derivations of the same span.

    Parameters
    ----------
    scores : torch.Tensor
        Scores to aggregate.
    dim : int
        Dimension to aggregate over.

    Returns
    -------
    torch.Tensor
        Aggregated scores.
    """

plus_pair `abstractmethod` ¶

plus_pair(a: Tensor, b: Tensor) -> Tensor

Binary additive combination (⊕) of two tensors.

Used for combining a cell with unary rule contributions.

PARAMETER	DESCRIPTION
`a`	First operand. TYPE: `Tensor`
`b`	Second operand. TYPE: `Tensor`

RETURNS	DESCRIPTION
`Tensor`	Combined scores.

Source code in src/quivers/stochastic/semiring.py

@abstractmethod
def plus_pair(self, a: torch.Tensor, b: torch.Tensor) -> torch.Tensor:
    """Binary additive combination (⊕) of two tensors.

    Used for combining a cell with unary rule contributions.

    Parameters
    ----------
    a : torch.Tensor
        First operand.
    b : torch.Tensor
        Second operand.

    Returns
    -------
    torch.Tensor
        Combined scores.
    """

LogProbSemiring ¶

Bases: ChartSemiring

Log-probability semiring: (logsumexp, +, -∞, 0).

This is the default semiring for marginal parsing. times is addition in log-space (= multiplication of probabilities) and plus is logsumexp (= addition of probabilities in log-space).

ViterbiSemiring ¶

Bases: ChartSemiring

Viterbi semiring: (max, +, -∞, 0).

Finds the single best derivation. plus takes the max over derivations rather than summing. Useful for Viterbi parsing and best-first decoding.

BooleanSemiring ¶

Bases: ChartSemiring

Boolean semiring: (or, and, 0, 1).

For recognition only: determines whether a parse exists without computing scores. Values are 0.0 (false) and 1.0 (true).

CountingSemiring ¶

Bases: ChartSemiring

Counting semiring: (+, ×, 0, 1).

Counts the number of derivations. plus sums counts and times multiplies them (product rule for independent choices).

Note: Not differentiable in a useful sense for gradient-based learning, but useful for analysis.

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search

Chart Semirings¶

semiring ¶

ChartSemiring ¶

zero abstractmethod property ¶

one abstractmethod property ¶

times abstractmethod ¶

plus abstractmethod ¶

plus_pair abstractmethod ¶

LogProbSemiring ¶

ViterbiSemiring ¶

BooleanSemiring ¶

CountingSemiring ¶

zero `abstractmethod` `property` ¶

one `abstractmethod` `property` ¶

times `abstractmethod` ¶

plus `abstractmethod` ¶

plus_pair `abstractmethod` ¶