signboard optical character recognition - stanford university...signboard optical character...

1
Signboard Optical Character Recognition Isaac Wu, Hsiao-Chen Chang Department of Electrical Engineering, Stanford University Motivation System Pipeline Results Having the ability to recognize any store just by taking a picture of its signboard is a powerful asset for business reviews and ratings companies such as Yelp to incorporate into their mobile app. Phase 1: Training A B C E D Phase 2: Segmentation MSER Grayscale Detect MSER Regions Remove Non- Text Regions Create Bounding Boxes of Each Region Merge Boxes and Keep the Longest Morphology Grayscale Increase Contrast Adaptive Thresholding Morphological Opening Small Region Removal Region Labeling Remove Non- Text Regions Create Bounding Box Phase 3: Recognition A B C E D MCDONALDS OCR Restrict OCR Matching to English Letters Perform OCR Remove short length words Remove Spaces Success Rate: 86% # Testing Images: 113 # Correctly Determined: 97 Algorithm trainDatabase() if (SIFT with MSER has many matches) return result elseif (SIFT with Morphology has many matches) return result elseif (OCR with MSER seems valid) return result elseif (OCR with Morphology seems valid) return result else return null * Multi-scale approach SIFT Extract SIFT Descriptors Match Descriptors to Codes Top 5 Database Matches Perform SIFT Match with RANSAC Return the Most Matches Manually Segmented Images Extract SIFT Descriptors Construct K-Means Codebook Match Descriptors to Codes 89.7% 7.2% 3.1% Techniques Used MSER x SIFT MORPH x SIFT MSER x OCR MORPH xOCR

Upload: others

Post on 26-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

  • Signboard Optical Character RecognitionIsaac Wu, Hsiao-Chen Chang

    Department of Electrical Engineering, Stanford University

    Motivation System Pipeline

    Results

    Having the ability to recognize any store just by taking

    a picture of its signboard is a powerful asset for

    business reviews and ratings companies such as Yelp

    to incorporate into their mobile app.

    Phase 1: Training A BC

    E

    …D

    Phase 2:

    Segmentation

    MSER GrayscaleDetect MSER

    RegionsRemove Non-Text Regions

    Create Bounding Boxes of Each Region

    Merge Boxes and Keep the

    Longest

    Morphology GrayscaleIncrease Contrast

    Adaptive Thresholding

    Morphological Opening

    Small Region Removal

    Region Labeling

    Remove Non-Text Regions

    Create Bounding Box

    Phase 3: RecognitionA B

    CE

    …D

    MCDONALDS

    OCRRestrict OCR Matching to

    English LettersPerform OCR

    Remove short length words

    Remove Spaces

    Success Rate: 86%

    # Testing Images: 113

    # Correctly Determined: 97

    AlgorithmtrainDatabase()

    if (SIFT with MSER has many matches) return result

    elseif (SIFT with Morphology has many matches) return result

    elseif (OCR with MSER seems valid) return result

    elseif (OCR with Morphology seems valid) return result

    else return null

    * Multi-scale approach

    SIFTExtract SIFT Descriptors

    Match Descriptors

    to Codes

    Top 5 Database Matches

    Perform SIFT Match with

    RANSAC

    Return the Most

    Matches

    Manually Segmented Images

    Extract SIFT Descriptors

    Construct K-Means Codebook

    Match Descriptors to Codes

    89.7%

    7.2%3.1%

    Techniques Used

    MSER x SIFT

    MORPH x SIFT

    MSER x OCR

    MORPH xOCR