Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues reproducing Precision/Recall/F1/F2 on the i2b2 dataset #11

Open
soulaven opened this issue Feb 19, 2021 · 1 comment
Open

Issues reproducing Precision/Recall/F1/F2 on the i2b2 dataset #11

soulaven opened this issue Feb 19, 2021 · 1 comment

Comments

@soulaven
Copy link

soulaven commented Feb 19, 2021

Hi,

Thank you for the development and release of this package. I followed the steps 0, 2a, 1b, 1c using the PHI config file, and then 2d with prod=True. In calculation of the scores and following my understanding of the paper, I separated all PHI text on the word level including sanitizing for edge cases such as "," and "." at the end of words (otherwise the stats are much lower). However, I was only able to achieve Precision 0.696 Recall 0.915 F1 0.791 F2 0.861 on the test set, which is some way away from the statistics reported on the i2b2 test set in the paper. I think I am most likely missing something, but am unsure what it is.

@RedChrists
Copy link
Collaborator

In addition to step 0, a manual review of the results may be necessary to confirm that the missed i2b2 tags are actual PHI according to HIPAA.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants