Discussion: View Thread

Papers on Database Design and Automatic Coding of Printed Materials

  • 1.  Papers on Database Design and Automatic Coding of Printed Materials

    Posted 08-14-2007 18:46
    I wanted to draw your attention to two papers I recently put together that may be useful for you and your doctoral students. My research team developed a couple of methods that I wrote up so that other people could benefit from them.

    Automatic Coding of Printed Materials
    Abstract:      
    The paper presents a complete method for using automatic techniques to code printed text pages. It involves three automatic steps and one or two steps of manual corrections to obtain fully accurate results. We discovered that present-day consumer digital cameras are much better than high-end scanners to obtain pictures of printed pages quickly and without the wear and tear associated with scanners. We also found that high-end ($370) OCR software is much more cost-effective to achieve accurate text recognition and to process large amounts of data. We also describe how researchers can write a computer program for classifying automatically non-uniform data. We provide detailed instructions for each step in the automatic coding method so that other researchers can readily copy it.

    Download at:
    http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1001568


    Constructing Effective Longitudinal Databases on Your PC
    Abstract:     
    The paper presents a strategy for designing longitudinal databases with FileMaker. The approach facilitates efficiency in entering data and flexibility for constructing statistical analyses from the raw data. The key feature of the strategy is to define the basic unit of observation in the database in terms of an agent, an event, and a date. Given that programs such as FileMaker can easily sort data by agent and date, once you structure the data correctly you can construct well-ordered event histories for agents, even if the researcher may enter the data in an unordered fashion. By using events that happened to an agent at a particular time as the basic unit of observation, one maintains maximum flexibility to do statistical analysis that aggregate basic data in different ways.

    Download at:
    http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1003657



    Best wishes,
    *******************************************************************
    Johann Peter Murmann

    Associate Professor of Strategic Management
    Academic Director of the Executive Year at the AGSM
    Head of the School of Strategy and Entrepreneurship


    Australian School of Business, Level 5
    University of New South Wales

    Sydney NSW 2052
    Australia


    New Phone: +61 (0) 2 9385 9733  Fax: +61 (0)2 9313 7279  
    Web:   http://professor-murmann.net

    Working Papers: http://ssrn.com/author=375099

    My recent book:   http://knowledge-and-competitive-advantage.info/

    Assistant: Avis Wong   +61 (0) 2 9385 5641  
        avisw@agsm.edu.au
    *******************************************************************