All Projects → thunlp → Attribute_charge

thunlp / Attribute_charge

The source code of our COLING'18 paper "Few-Shot Charge Prediction with Discriminative Legal Attributes".

Programming Languages

python
139335 projects - #7 most used programming language

Few-Shot Charge Prediction with Discriminative Legal Attributes

Source code and datasets of COLING2018 paper: "Few-Shot Charge Prediction with Discriminative Legal Attributes". (pdf)

Dataset

Please download the dataset here, unzip it and you will get three folders: "data", "data_20w", "data_38w". Then put the folder "data" under this directory. It contains following files:

  • words.vec: Pre-trained word embeddings, each line contains a word and its embedding.
  • attributes: The legal attributes for each charge.
  • train: data for training from small dataset.
  • test: data for test from small dataset.
  • valid: data for validation from small dataset.

If you want to train and test on middle dataset, please copy the files in "data_20w" folder to "data" folder. If you want to train and test on large dataset, please copy the files in "data_38w" folder to "data" folder.

Run

Run the following command for training our model:

cd code/
python train.py

Dependencies

  • Tensorflow == 0.12
  • Scipy == 0.18.1
  • Numpy == 1.11.2
  • Python == 2.7

Log

After start training, a new folder "log" will be created.There are 4 directories in it:

  • /evaluation_charge_log/: stores model's performance of charge prediction on test data during training.
  • /evaluation_attr_log/: stores model's performance of attribute prediction on test data during training.
  • /validation_charge_log/: stores model's performance of charge prediction on validation data during training.
  • /validation_attr_log/: stores model's performance of attribute prediction on validation data during training.

Cite

If you use the code, please cite this paper:

Zikun Hu, Xiang Li, Cunchao Tu, Zhiyuan Liu, Maosong Sun. Few-Shot Charge Prediction with Discriminative Legal Attributes. The 27th Iinternational Conference on Computational Liguisitics (COLING 2018).

For more related works, please refer to my homepage.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].