Token Classification
Transformers
Safetensors
Chinese
bert
Seq2SeqLM
古文
文言文
中国古代官职地名拆分
ancient
classical
Instructions to use cbdb/OfficeTitleAddressSplitter with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use cbdb/OfficeTitleAddressSplitter with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("token-classification", model="cbdb/OfficeTitleAddressSplitter")# Load model directly from transformers import AutoTokenizer, AutoModelForTokenClassification tokenizer = AutoTokenizer.from_pretrained("cbdb/OfficeTitleAddressSplitter") model = AutoModelForTokenClassification.from_pretrained("cbdb/OfficeTitleAddressSplitter") - Notebooks
- Google Colab
- Kaggle
Update sample txt file
Browse files
README.md
CHANGED
|
@@ -16,6 +16,10 @@ license: cc-by-nc-sa-4.0
|
|
| 16 |
|
| 17 |
Our model <font color="cornflowerblue">OTAS (Office Title Address Splitter) </font> is a Named Entity Recognition Classical Chinese language model that is intended to <font color="IndianRed">split the address portion in Classical Chinese office titles.</font>. This model is first inherited from raynardj/classical-chinese-punctuation-guwen-biaodian Classical Chinese punctuation model, and finetuned using over a 25,000 high-quality punctuation pairs collected CBDB group (China Biographical Database).
|
| 18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
### <font color="IndianRed"> How to use </font>
|
| 20 |
|
| 21 |
Here is how to use this model to get the features of a given text in PyTorch:
|
|
|
|
| 16 |
|
| 17 |
Our model <font color="cornflowerblue">OTAS (Office Title Address Splitter) </font> is a Named Entity Recognition Classical Chinese language model that is intended to <font color="IndianRed">split the address portion in Classical Chinese office titles.</font>. This model is first inherited from raynardj/classical-chinese-punctuation-guwen-biaodian Classical Chinese punctuation model, and finetuned using over a 25,000 high-quality punctuation pairs collected CBDB group (China Biographical Database).
|
| 18 |
|
| 19 |
+
### <font color="IndianRed"> Sample input txt file </font>
|
| 20 |
+
The sample input txt file can be downloaded here:
|
| 21 |
+
https://huggingface.co/cbdb/OfficeTitleAddressSplitter/blob/main/vocab.txt
|
| 22 |
+
|
| 23 |
### <font color="IndianRed"> How to use </font>
|
| 24 |
|
| 25 |
Here is how to use this model to get the features of a given text in PyTorch:
|