Skip to content
This repository was archived by the owner on Mar 28, 2026. It is now read-only.

simplify Python API by removing intermediate classes#96

Merged
mosuka merged 1 commit into
mainfrom
deps
Sep 14, 2025
Merged

simplify Python API by removing intermediate classes#96
mosuka merged 1 commit into
mainfrom
deps

Conversation

@mosuka

@mosuka mosuka commented Sep 14, 2025

Copy link
Copy Markdown
Member
  • Remove PyToken class and use dict for token representation
  • Remove PySegmenter class, integrate functionality into Tokenizer
  • Change Tokenizer initialization to accept direct parameters instead of Segmenter
  • Update all examples and tests to use new simplified API
  • Bump lindera dependency to 1.2.0

BREAKING CHANGE: Tokenizer now takes dictionary as first argument,
mode and user_dictionary as optional parameters. Token objects are
now returned as dictionaries with "surface" key instead of objects
with .text attribute.

@mosuka mosuka merged commit b6d3013 into main Sep 14, 2025
5 checks passed
@mosuka mosuka deleted the deps branch September 14, 2025 14:14
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant