--- license: mit language: - en tags: - lstm - text-segmentation - lightweight - client-side - web - onnxruntime-web - speech-to-text - low-memory-footprint --- The sentence splitter model is modification of the LSTM model with around 500 B input size taken from the repository: [NNSplit](https://github.com/kornelski/nnsplit) The size of the model used here is **~4 MB**. | | NNSplit | Spacy (Tagger) | Spacy (Sentencizer) | |------------------------|------------|----------------|---------------------| | Clean | 0.754371 | 0.853603 | 0.820934 | | Partial punctuation | 0.485907 | 0.517829 | 0.249753 | | Partial case | 0.761754 | 0.825119 | 0.819679 | | Partial punctuation and case | 0.443704 | 0.458619 | 0.249873 | | No punctuation and case| 0.166273 | 0.180859 | 0.00463281 | ### Example No punctuation and no cases (~17% accuracy)
**Input:** ```text the difference between rest and graphql is explained as follows rest is an architectural style that exposes resources via endpoints typically following crud operations each endpoint returns a fixed data structure graphql on the other hand allows clients to specify exactly what data they need in a single query often reducing overfetching and underfetching issues ``` **Result: 28.90ms ✅** 1. the difference between rest and graphql is explained as follows 2. rest is an architectural style that exposes resources via endpoints typically following crud operations 3. each endpoint returns a fixed data structure 4. graphql on the other hand allows clients to specify exactly what data they need in a single query often reducing overfetching and underfetching issues