cbdb
/

OfficeTitleAddressSplitter

Token Classification

中国古代官职地名拆分

Model card Files Files and versions

cbdb commited on Feb 2, 2024

Commit

0366aed

·

verified ·

1 Parent(s): ca2f00d

Update sample txt file

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -16,6 +16,10 @@ license: cc-by-nc-sa-4.0
 Our model <font color="cornflowerblue">OTAS (Office Title Address Splitter) </font> is a Named Entity Recognition Classical Chinese language model that is intended to  <font color="IndianRed">split the address portion in Classical Chinese office titles.</font>. This model is first inherited from raynardj/classical-chinese-punctuation-guwen-biaodian Classical Chinese punctuation model, and finetuned using over a 25,000 high-quality punctuation pairs collected CBDB group (China Biographical Database).
 ### <font color="IndianRed"> How to use </font>
 Here is how to use this model to get the features of a given text in PyTorch:

 Our model <font color="cornflowerblue">OTAS (Office Title Address Splitter) </font> is a Named Entity Recognition Classical Chinese language model that is intended to  <font color="IndianRed">split the address portion in Classical Chinese office titles.</font>. This model is first inherited from raynardj/classical-chinese-punctuation-guwen-biaodian Classical Chinese punctuation model, and finetuned using over a 25,000 high-quality punctuation pairs collected CBDB group (China Biographical Database).
+### <font color="IndianRed"> Sample input txt file </font>
+The sample input txt file can be downloaded here:
+https://huggingface.co/cbdb/OfficeTitleAddressSplitter/blob/main/vocab.txt
 ### <font color="IndianRed"> How to use </font>
 Here is how to use this model to get the features of a given text in PyTorch: