Vicomtech at MEDDOCAN: Medical Document Anonymization

Egileak: Naiara Perez Miguel Laura García Sardiña Manex Serras Saenz Arantza del Pozo Echezarreta

Data: 01.08.2019


Abstract

This paper describes the participation of Vicomtech s team in the MEDDOCAN: Medical Document Anonymization challenge, which consisted in the recognition and classification of protected health information (PHI) in medical documents in Spanish. We tested different state-of-the-art classification algorithms, both deep and shallow, and rich sets of features, obtaining an F1-score of 0.960 in the strictest evaluation. The models submitted and scripts for decoding will be available at https://snlt.vicomtech.org/meddocan2019.

BIB_text

@Article {
title = {Vicomtech at MEDDOCAN: Medical Document Anonymization},
pages = {696-703},
keywds = {
PHI De-identification Textual Anonymisation Machine Learning Spanish Corpus
}
abstract = {

This paper describes the participation of Vicomtech s team in the MEDDOCAN: Medical Document Anonymization challenge, which consisted in the recognition and classification of protected health information (PHI) in medical documents in Spanish. We tested different state-of-the-art classification algorithms, both deep and shallow, and rich sets of features, obtaining an F1-score of 0.960 in the strictest evaluation. The models submitted and scripts for decoding will be available at https://snlt.vicomtech.org/meddocan2019.


}
date = {2019-08-01},
}
Vicomtech

Gipuzkoako Zientzia eta Teknologia Parkea,
Mikeletegi Pasealekua 57,
20009 Donostia / San Sebastián (Espainia)

+(34) 943 309 230

Zorrotzaurreko Erribera 2, Deusto,
48014 Bilbo (Espainia)

close overlay

Jokaeraren araberako publizitateko cookieak beharrezkoak dira eduki hau kargatzeko

Onartu jokaeraren araberako publizitateko cookieak