Difference between revisions of "Hyejuj project abstract"

From Cohen Courses
Jump to navigationJump to search
Line 1: Line 1:
 
== What I plan to do ==
 
== What I plan to do ==
 
I propose a semantic tagger that provides high level concept information for phrases in clinical narrative texts. I am going to use clinical narrative documents written by Korean doctors. The high level concept information which will be annotated is below.
 
I propose a semantic tagger that provides high level concept information for phrases in clinical narrative texts. I am going to use clinical narrative documents written by Korean doctors. The high level concept information which will be annotated is below.
 
+
**Target Semantic Tag
== Motivation ==
 
Clinical documents are invaluable information. Semantic tagging on clinical documents will help doctors with a support for medical decision making or for quality assurance of medical treatment.
 
 
 
== Interesting point ==
 
The clinical documents are written in both Korean and English. Usually, English is used for the medical terminologies, and Korean is used for some general nouns and most verbs though there are many exceptions.
 
 
 
== Target Semantic Tag ==
 
 
* Symptom
 
* Symptom
 
* Diagnosis
 
* Diagnosis
Line 18: Line 11:
 
* Performance
 
* Performance
 
* Patient Result
 
* Patient Result
 +
 +
 +
== Motivation ==
 +
Clinical documents are invaluable information. Semantic tagging on clinical documents will help doctors with a support for medical decision making or for quality assurance of medical treatment.
 +
 +
== Interesting point ==
 +
The clinical documents are written in both Korean and English. Usually, English is used for the medical terminologies, and Korean is used for some general nouns and most verbs though there are many exceptions.
 +
  
 
== Dataset ==
 
== Dataset ==

Revision as of 23:02, 29 September 2010

What I plan to do

I propose a semantic tagger that provides high level concept information for phrases in clinical narrative texts. I am going to use clinical narrative documents written by Korean doctors. The high level concept information which will be annotated is below.

    • Target Semantic Tag
  • Symptom
  • Diagnosis
  • Test
  • Test Result
  • Treatment Plan
  • Treatment
  • Treatment Stop
  • Performance
  • Patient Result


Motivation

Clinical documents are invaluable information. Semantic tagging on clinical documents will help doctors with a support for medical decision making or for quality assurance of medical treatment.

Interesting point

The clinical documents are written in both Korean and English. Usually, English is used for the medical terminologies, and Korean is used for some general nouns and most verbs though there are many exceptions.


Dataset

I have 600 manually tagged clinical narrative documents. They have been tagged with Unifies Medical Language System (UMLS), Part-of-Speech (POS).

Evaluation

The performance of the system can be measured as the level of accuracy of annotation, and it can be calculated as the number of correct tags per the total number of tags.

Techniques that can be used to solve this problem

  • To use Conditional Random Field
  • To use UMLS, POS, abbreviation, clue words, and numerical information to produce higher level concept information.

What question to answer

Can we show good performance on high-level semantic tagging using CRF?

Team Member

Hyeju Jang

Related Experience

I have developed a semantic tagger using Hidden Markov Model (HMM) in 2006. At that time, the target semantic tags were "Symptom", "Therapy", and "Performance."

References