|
|
@@ -1,5 +1,13 @@
|
|
|
# Semantic-Split
|
|
|
|
|
|
+This is a fork of semantic_split to allow other spacy models.
|
|
|
+
|
|
|
+Install (comfortably) via by pip:
|
|
|
+
|
|
|
+(switch to virtualenv)
|
|
|
+pip install git+https://git.alexanderhuwiler.ch/alexander/semantic_split_MultiLang.git
|
|
|
+
|
|
|
+
|
|
|

|
|
|
|
|
|
A Python library to chunk/group your text based on semantic similarity - ideal for pre-processing data for Language Models or Vector Databases. Leverages [SentenceTransformers](https://github.com/UKPLab/sentence-transformers) and [spaCy](https://github.com/explosion/spaCy).
|