vasundhara chakraborty, victoria chiu, and miklos vasarhelyi san francisco july 31, 2010
DESCRIPTION
Automatic Classification of Accounting Literature Nineteenth Annual Strategic and Emerging Technologies Workshop. Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010. Automatic Classification of Accounting Literature. Introduction and Background - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/1.jpg)
Automatic Classification of Accounting Literature
Nineteenth Annual Strategic and Emerging Technologies Workshop
Vasundhara Chakraborty, Victoria Chiu, and Miklos VasarhelyiSan Francisco July 31, 2010
![Page 2: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/2.jpg)
OUTLINE
2
Automatic Classification of Accounting Literature
Introduction and Background Motivation and Research Questions
Literature Review Classification of Accounting Research- The Manual Method Development of Automatic Classification Method
Methodology- A Two-Phase Experiment Phase I: Keywords Phase II: Full Abstract
Results and Analysis Treatment Mode of Reasoning Accounting Area
Conclusion and Implication
![Page 3: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/3.jpg)
Introduction (1/5)
• Purpose
This study explores the possibility of developing a methodology to classify accounting academic publication automatically.
• Research Questions Can we automate the classification process (of accounting literature)
by using the keywords from academic journal articles?
Can we automate the classification process (of accounting literature) by using the full abstracts from academic journal articles?
Do results vary depending on which elements we use to automate the literature classification process and to what extent do they differ?
3
Automatic Classification of Accounting Literature
![Page 4: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/4.jpg)
Introduction (2/5)
• Contribution
Extending the usefulness of automatic text classification method to publications, and
Seeking the possibility to improve the methodology applied in research that investigates the attributes and development of knowledge in accounting discipline.
4
Automatic Classification of Accounting Literature
![Page 5: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/5.jpg)
Introduction (3/5)
• Motivation
Literature taxonomization is a critical element for revealing the development and evolution of knowledge in disciplines (Brown et al. 1987, 1989, Vasarhelyi et al. 1988, Brinberg and Shields 1989, Meyer and Rigsby 2001, Heck and Jensen 2007).
Traditionally, the taxonomization process has been manually performed in this research area (Vasarhelyi et al. 1984, Brown et al. 1985, 1989, and Badua 2005).
5
Automatic Classification of Accounting Literature
![Page 6: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/6.jpg)
Introduction (4/5)
• Motivation (cont.)
The rapid growth of collections in online academic databases indicates that there is increasing difficulty for professionals to access information in a timely and efficient way (Nobata 1999).
Gangolly and Wu (2000)- the development of methods for automatic indexing and classification of concepts has been necessitated due to The increase of text databases size. High cost of domain expertise to develop classifications.
6
Automatic Classification of Accounting Literature
![Page 7: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/7.jpg)
Introduction (5/5)
7
• Taxonomy for Classifying Accounting Publications Accounting Research Directory- The Database of Accounting Literature
(ARD)- (Brown, Gardner and Vasarhelyi, 1993).Treatment, Accounting Area, Mode of Reasoning, Research Method, Inference
Style, Mode of Analysis, School of Thought, Information, Geography, Objective, Applicability, and Foundation Discipline.
Treatment: identifies the major factor (independent variable) or other accounting phenomenon associated with/causes the Information taxon (dependent variable).
Accounting Area: identifies the major accounting field under which the article belongs.
Mode of Reasoning: identifies the technique used to formally arrive at the conclusion in the article.
Automatic Classification of Accounting Literature
![Page 8: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/8.jpg)
Literature Review (1/4)
I. Accounting Literature Classification- The Manual Method
• Literature on examining the attributes and development of accounting research manually classifies publications that represent the core knowledge of accounting discipline. • Kinard and Putney 1968, Gonedes and Dopuch 1974, Hofstedt 1975,
1976, Brown et al. 1987, Vasarhelyi et al. 1988, Rigsby 2001.
• Vasarhelyi et al. (1988) examine the trend of accounting research within 1963 and 1984. • Taxonomy: Research Method, Foundation Discipline, School of
Thought, and Mode of Reasoning.
8
Automatic Classification of Accounting Literature
![Page 9: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/9.jpg)
Literature Review (2/4)
I. Accounting Literature Classification- The Manual Method (cont.)
• Brown et al. (1989) researched on accounting publications in academic journals (AOS, TAR, JAE, and JAR) from 1976 to 1984. Accounting Area, Research Method, School of Thought, and
Geographic.
• Fleming et al. (2000) studied the evolution of research in The Accounting Review (TAR) within 1966 and 1985. Focused attributes: Research Methods, Financial Accounting
subtopics, article length, citations, and author background.
9
Automatic Classification of Accounting Literature
![Page 10: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/10.jpg)
Literature Review (3/4)
II. Development of Automatic Classification Method
• Crouch and Yang (1992) found that automatic classification method produces useful thesaurus classes when supplementing query terms.
• Chen et al. (1995) automatically generated a thesaurus to evaluate Worm Community System (WCS) by adopting the algorithmic approach developed by Chen and Lynch (1992) which was applied to generate a concept network.
• Nobata (1999) used statistical and decision tree methods to identify and classify biology terms automatically.
Refining the applied algorithms for automating classification process is needed.
10
Automatic Classification of Accounting Literature
![Page 11: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/11.jpg)
Literature Review (4/4)
II. Development of Automatic Classification Method (cont.)
• Classifying financial accounting concepts automatically (Gangolly 2000).
Term frequency in financial accounting standards was analyzed and clusters of concepts are derived by agglomerative nesting algorithm.
• Automatic grouping related accounting concepts (Garnsey 2006) Semantic parsing techniques and statistical methods were used.
11
Automatic Classification of Accounting Literature
![Page 12: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/12.jpg)
• Sample Collection Three hundred and fifty eight articles published in accounting journals
were downloaded.
Methodology (1/3)
Automatic Classification of Accounting Literature
![Page 13: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/13.jpg)
Word count Attribute Selection
Create document –term matrix
Apply classification algorithms
Word countAttribute Selection
Create document –term matrix
Apply classification algorithms
Phase II: Using Full Abstract
Term frequency
Create phrases
Term frequency
Journal articles Parse out keywords
Phase I: Using Keywords
Methodology (2/3)
Database
Validation of results
Validation of results
Parse out full abstractJournal articles
Database
Automatic Classification of Accounting Literature
![Page 14: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/14.jpg)
Examples of Keywords used for Treatment Taxon Classification
Methodology (3/3)
Automatic Classification of Accounting Literature
![Page 15: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/15.jpg)
Phase I: Keywords Analysis with all Subclasses
Results and Analysis (1/5)
Analysis on Treatment Taxon
Automatic Classification of Accounting Literature
![Page 16: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/16.jpg)
Phase I: Keywords Analysis with Class Modification
Results and Analysis (2/5)
Analysis on Treatment Taxon
Automatic Classification of Accounting Literature
![Page 17: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/17.jpg)
Phase II: Full Abstracts Analysis with Class Modification
Results and Analysis (3/5)
Analysis on Treatment Taxon
Automatic Classification of Accounting Literature
![Page 18: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/18.jpg)
Results and Analysis (4/5) Analysis on Accounting Area Taxon
Automatic Classification of Accounting Literature
Automatic Classification of Accounting Literature
![Page 19: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/19.jpg)
Results and Analysis (5/5) Analysis on Mode of Reasoning Taxon
Automatic Classification of Accounting Literature
Automatic Classification of Accounting Literature
![Page 20: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/20.jpg)
Comparison of Results
Conclusion (1/2)
Automatic Classification of Accounting Literature
![Page 21: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/21.jpg)
Conclusion (2/2)• Findings Summary
This study shows that using semantic parsing and data mining techniques, we can classify academic publications.
Treatment and Accounting Area taxons can be classified relatively better.
• Limitations Limited number of articles were used for the experiments. A more comprehensive database with sufficient representation of
articles belonging to different subclasses is needed.• Future Research
Use of a larger data corpus. Use of full text. Use semantic parsing and data mining methods to discover new
emerging paradigms in the accounting literature.
21
Automatic Classification of Accounting Literature
![Page 22: Vasundhara Chakraborty, Victoria Chiu, and Miklos Vasarhelyi San Francisco July 31, 2010](https://reader035.vdocuments.us/reader035/viewer/2022062323/56816585550346895dd83421/html5/thumbnails/22.jpg)
Thank You!
22
Automatic Classification of Accounting Literature