← Back to all repos

RFTokenizer

https://github.com/amir-zeldes/RFTokenizer

📊 Stats

⭐ Stars: 31

📝 Language: Lex

📝 Description: A character-wise tokenizer for morphologically rich languages

⭐ Star Growth (12 months)

🔬 Research Notes

Stats

  • ⭐ Stars: 31
  • 🍴 Forks: 8
  • 📝 Language: Lex
  • 📅 Created: 2018-03-05
  • 🔄 Updated: 2026-02-07
  • 🏷️ Latest Release: No releases
  • Description

    A character-wise tokenizer for morphologically rich languages

    Topics

    None

    Research Summary

    Key Features

  • Architecture

  • Use Cases

  • Assessment

  • Maturity:
  • Documentation:
  • Community:
  • Recommendation:
  • README Excerpt

    ```

    žée

    ```

    ---

    *Researched: 2026-03-27*

    Generated: 2026-03-28