fenn.datasets

class fenn.datasets.TextDataset(X, y, tokenizer, max_length=1024)[source]

Bases: Dataset

Generic text + binary label dataset. X: list[str] y: list[int|float]

Parameters:
  • X (Sequence[str])

  • y (Sequence[int | float] | None)

  • max_length (int)

__init__(X, y, tokenizer, max_length=1024)[source]
Parameters:
  • X (Sequence[str])

  • y (Sequence[int | float] | None)

  • max_length (int)