Identifying Layout Classes for Mathematical Symbols Using Layout Context

Show simple item record Zanibbi, Richard Ouyang, Ling 2009-10-29T19:25:08Z 2009-10-29T19:25:08Z 2009
dc.description.abstract We describe a symbol classification technique for identifying the expected locations of neighboring symbols in mathematical expressions. We use the seven symbol layout classes of the DRACULAE math notation parser (Zanibbi, Blostein, and Cordy, 2002) to represent expected locations for neighboring symbols: Ascender, Descender, Centered, Open Bracket, Non-Script, Variable Range (e.g. integrals) and Square Root. A new feature based on shape contexts (Belongie et al., 2002) named layout context is used to describe the arrangement of neighboring symbol bounding boxes relative to a reference symbol, and the nearest neighbor rule is used for classification. 1917 mathematical symbols from the University of Washington III document database are used in our experiments. Using a leave-one-out estimate, our best classification rate reaches nearly 80%. In our experiments, we find that the size of the symbol neighborhood, and number and arrangement of key points representing a symbol affect performance significantly.
dc.language.iso en_US
dc.publisher IEEE Western New York Image Processing Workshop
dc.relation RIT Scholars content from RIT Digital Media Library has moved from to RIT Scholar Works, please update your feeds & links!
dc.subject shape contexts en_US
dc.subject document layout analysis en_US
dc.subject character recognition en_US
dc.subject math recognition en_US
dc.title Identifying Layout Classes for Mathematical Symbols Using Layout Context
dc.type Article

Files in this item

Files Size Format View
WNYIP_2009_Ouyang.pdf 491.6Kb PDF View/Open

This item appears in the following Collection(s)

Show simple item record

Search RIT DML

Advanced Search