lecture #3 research methodology 1. 2 understand what is research define research problem how to...

Click here to load reader

Upload: annabel-marsh

Post on 23-Dec-2015

215 views

Category:

Documents


1 download

TRANSCRIPT

  • Slide 1
  • Lecture #3 Research Methodology 1
  • Slide 2
  • 2 Understand What is Research Define Research Problem How to solve it? -Research methods -Experiment Design How to write and publish an IT paper? Roadmap
  • Slide 3
  • How to solve the problem? Understanding the problem Distinguishing the unknown, the data and the condition Devising a plan Connecting the data to the unknown, finding related problems, relying on previous findings Carrying out the plan Validating each step, (if possible) proving correctness Looking back Checking the results, contemplating alternative solutions, exploring further potential of result/method 3
  • Slide 4
  • Research Methods Vs Methodology Research Methods are the methods by which you conduct research into a subject or a topic. involve conduct of experiments, tests, surveys,. Research Methodology is the way in which research problems are solved systematically. It is the Science of studying how research is conducted Scientifically involves the learning of the various techniques that can be used in the conduct of research and in the conduct of tests, experiments, surveys and critical studies. 4 http://www.differencebetween.com/difference-between-research-methods-and-vs-research- methodology/
  • Slide 5
  • Research Approaches Quantitative Approach (Uses experimental, inferential and simulation approaches to research) Qualitative Approach (Uses techniques like in-depth interview, focus group interviews) 5 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 6
  • Types of Research in general Descriptive Analytical Applied Fundamental Quantitative Qualitative Conceptual Empirical Other Types 6 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 7
  • Descriptive Vs Analytical In Descriptive Research, the Researcher has to only report what is happening or what has happened. In Analytical Research, the Researcher has to use the already available facts or information, and analyse them to make a critical evaluation of the subject 7 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 8
  • Applied Vs Fundamental Applied Research, is an attempt to find solution to an immediate problem encountered by a firm, an Industry, a business organization, or the Society. Fundamental Research, is gathering knowledge for knowledges sake is called Pure or Basic. 8 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 9
  • Quantitative Vs Qualitative Quantitative Research involves the measurement of quantity or amount. (ex: Economic & Statistical methods) Qualitative Research is concerned with the aspects related to or involving quality or Kind.(ex: Motivational Research involving behavioural Sciences) 9 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 10
  • Conceptual Vs Empirical Conceptual Research, The Research related to some abstract idea or theory. (Ex: Philosophers and Thinkers using this to developing new concepts) Empirical Research relies on the observation or experience with hardly any regard for theory and system. 10 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 11
  • Other Types of Research One-time or Longitudinal Research (On the basis time) Laboratory Research or Field-setting or Simulational Research (On the basis of environment) Historical Research 11 Shashikant S Kulkarni,Research Methodology An Introduction
  • Slide 12
  • Research Method Classification in Computer Science Scientific: understanding nature Engineering: providing solutions Empirical: data centric models Analytical: theoretical formalism Computing: hybrid of methods 12 From W.R.Adrion, Research Methodology in Software Engineering, ACM SE Notes, Jan. 1993
  • Slide 13
  • Scientist vs. Engineer A scientist sees a phenomenon and asks why? and proceeds to research the answer to the question. An engineer sees a practical problem and wants to know how to solve it and how to implement that solution, or how to do it better if a solution exists. A scientist builds in order to learn, but an engineer learns in order to build 13
  • Slide 14
  • The Scientific Method Observe real world Propose a model or theory of some real world phenomena Measure and analyze above Validate hypotheses of the model or theory If possible, repeat 14
  • Slide 15
  • The Engineering Method Observe existing solutions Propose better solutions Build or develop better solution Measure, analyze, and evaluate Repeat until no further improvements are possible 15
  • Slide 16
  • The Empirical Method Propose a model Develop statistical or other basis for the model Apply to case studies Measure and analyze Validate and then repeat 16
  • Slide 17
  • The Analytical Method Propose a formal theory or set of axioms Develop a theory Derive results If possible, compare with empirical observations Refine theory if necessary 17
  • Slide 18
  • The Computing Method 18
  • Slide 19
  • Empirical Method example (1) Do algorithm animations assist learning?: an empirical study and analysis Algorithm animations are dynamic graphical illustrations of computer algorithms, and they are used as teaching aids to help explain how the algorithms work. Although many people believe that algorithm animations are useful this way, no empirical evidence has ever been presented supporting this belief. We have conducted an empirical study of a priority queue algorithm animation, and the study's results indicate that the animation only slightly assisted student understanding. In this article, we analyze those results and hypothesize why algorithm animations may not be as helpful as was initially hoped. We also develop guidelines for making algorithm animations more useful in the future. 19
  • Slide 20
  • Empirical Method example (2) An empirical study of FORTRAN programs A sample of programs, written in FORTRAN by a wide variety of people for a wide variety of applications, was chosen at random in an attempt to discover quantitatively what programmers really do. Statistical results of this survey are presented here, together with some of their apparent implications for future work in compiler design. The principal conclusion which may be drawn is the importance of a program profile, namely a table of frequency counts which record how often each statement is performed in a typical run; there are strong indications that profile-keeping should become a standard practice in all computer systems, for casual users as well as system programmers. This paper is the report of a three month study undertaken by the author and about a dozen students and representatives of the software industry during the summer of 1970. It is hoped that a reader who studies this report will obtain a fairly clear conception of how FORTRAN is being used, and what compilers can do about it.. 20
  • Slide 21
  • Research Phases Informational: gathering information through reflection, literature, people survey Propositional: Proposing/formulating a hypothesis, method, algorithm, theory or solution Analytical: analyzing and exploring proposition, leading to formulation, principle or theory Evaluative: evaluating the proposal 21 R.L. Glass, A structure-based critique of contemporary computing research, Journal of Systems and Software Jan (1995)
  • Slide 22
  • Method-Phase Matrix Methods/PhasesInformationalPropositionalAnalyticalEvaluative ScientificObserve the world Propose a model or theory or behavior Measure and analyze Validate hypothesis of the model or theory; if possible repeat EngineeringObserve existing solutions Propose better solutions; build or develop Measure and analyze Measure and analyze; repeat until no further improvements possible EmpiricalPropose a model; develop statistical or other methods Apply to case studies; measure and analyze Measure and analyze; validate model; repeat AnalyticalPropose a formal theory or set of axioms Develop a theory; derive results Derive results; compare with empirical observations if possible
  • Slide 23
  • Experimenting: experiment design 23
  • Slide 24
  • Scientific method in one minute 1. Use experience and observations to gain insight about a phenomenon 2. Construct a hypothesis 3. Use hypothesis to predict outcomes 4. Test hypothesis by experimenting 5. Analyze outcome of experiment 6. Go back to step 1 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/ 24
  • Slide 25
  • Typical computer science scenario A particular task needs to be solved by a software system This task is currently solved by an existing system (a baseline) You propose a new, in your opinion, better system You argue why your proposed system is better than the baseline You support your arguments by providing evidence that your system 25 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 26
  • Running example in this lecture Text entry on a Tablet PC: A. Handwriting recognition B. Software keyboard 26 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 27
  • Why experiments? Substantiate claims A research paper needs to provide evidence to convince other researchers of the papers main points Strengthen or falsify hypotheses My system/technique/algorithm is [in some aspect] better than previously published systems/techniques/algorithms Evaluate and improve/revise/reject models The published model predicts users will type at 80 wpm on average after 40 minutes of practice with a thumb keyboard. In our experiment no one surpassed 25 wpm after several hours of practice. Gain further insights, stimulate thinking and creativity 27 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 28
  • Experiments in Computer Science 28
  • Slide 29
  • 29 Experiments in Computer Science
  • Slide 30
  • Experiment example 30 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 31
  • Research Method : Feasibility Study Metaphor: Christopher Columbus and western route to India 1. Is it possible to solve a specific kind of problem effectively ? computer science perspective (Turing test, ) engineering perspective (build efficiently; fast small) economic perspective (cost effective; profitable) 2. Is the technique new / novel / innovative ? compare against alternatives 3. See literature survey; comparative study 4. Proof by construction build a prototype often by applying on a CASE primarily qualitative; "lessons learned quantitative economic perspective: cost - benefit engineering perspective: speed - memory footprint 31 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 32
  • Feasibility Study Example : A feasibility study for power management in LAN switches We examine the feasibility of introducing power management schemes in network devices in the LAN. Specifically, we investigate the possibility of putting various components on LAN switches to sleep during periods of low traffic activity. Traffic collected in our LAN indicates that there are significant periods of inactivity on specific switch interfaces. Using an abstract sleep model devised for LAN switches, we examine the potential energy savings possible for different times of day and different interfaces (e.g., interfaces connecting to hosts to switches, or interfaces connecting switches, or interfaces connecting switches and routers). Algorithms developed for sleeping, based on periodic protocol behavior as well as traffic estimation are shown to be capable of conserving significant amounts of energy. Our results show that sleeping is indeed feasible in the LAN and in some cases, with very little impact on other protocols. However, we note that in order to maximize energy savings while minimizing sleep-related losses, we need hardware that supports sleeping. 32 http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1348125&tag=1
  • Slide 33
  • Pilot Case !Metaphor: Portugal (Amerigo Vespucci) explores western route Here is an idea that has proven valuable; does it work for us ? Proven valuable accepted merits (e.g. lessons learned from feasibility study) there is some (implicit) theory explaining why the idea has merit does it work for us context is very important Demonstrated on a simple yet representative CASE Pilot case Pilot Study Proof by construction build a prototype apply on a case 33 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 34
  • Pilot Case Example : Code quality analysis in open source software development Abstract Proponents of open source style software development claim that better software is produced using this model compared with the traditional closed model. However, there is little empirical evidence in support of these claims. In this paper, we present the results of a pilot case study aiming: (a) to understand the implications of structural quality; and (b) to figure out the benefits of structural quality analysis of the code delivered by open source style development. To this end, we have measured quality characteristics of 100 applications written for Linux, using a software measurement tool, and compared the results with the industrial standard that is proposed by the tool. Another target of this case study was to investigate the issue of modularity in open source as this characteristic is being considered crucial by the proponents of open source for this type of software development. We have empirically assessed the relationship between the size of the application components and the delivered quality measured through user satisfaction. We have determined that, up to a certain extent, the average component size of an application is negatively related to the user satisfaction for this application. 34 http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.7392
  • Slide 35
  • Comparative Study Here are two techniques, which one is better ? For a given purpose ! Where are the differences ? What are the tradeoffs ? Criteria check-list qualitative and quantitative qualitative: how to remain unbiased ? quantitative: represent what you want to know ? Often by applying the technique on a CASE Compare typically in the form of a table 35 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 36
  • Comparative Study A comparative study of fuzzy rough sets Abstract http://www.sciencedirect.com/science/article/pii/S016501140100032X 36
  • Slide 37
  • Observational Study Understand phenomena through observations Metaphor: Diane Fossey Gorillas in the Mist Systematic collection of data derived from direct observation of the everyday life phenomena is best understood in the fullest possible context observation & participation interviews & questionnaires 37 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 38
  • Observational Study Example: Action Research Action research is carried out by people who usually recognize a problem or limitation in their workplace situation and, together, devise a plan to counteract the problem, implement the plan, observe what happens, reflect on these outcomes, revise the plan, implement it, reflect, revise and so on. Conclusions primarily qualitative: classifications/observations/ 38 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 39
  • Literature Survey What is known ? What questions are still open ? Systematic comprehensive precise research question is prerequisite defined search strategy (rigor, completeness, replication) clearly defined scope criteria for inclusion and exclusion specify information to be obtained the CASES are the selected papers 39
  • Slide 40
  • Formal Model How can we understand/explain the world ? make a mathematical abstraction of a certain problem analytical model, stochastic model, logical model, re-write system,... prove some important characteristics Example : A Formal Model of Crash Recovery in a Distributed System Abstract A formal model for atomic commit protocols for a distributed database system is introduced. The model is used to prove existence results about resilient protocols for site failures that do not partition the network and then for partitioned networks. For site failures, a pessimistic recovery technique, called independent recovery, is introduced and the class of failures for which resilient protocols exist is identified. For partitioned networks, two cases are studied: the pessimistic case in which messages are lost, and the optimistic case in which no messages are lost. In all cases, fundamental limitations on the resiliency of protocols are derived. 40 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 41
  • Simulation What would happen if ? study circumstances of phenomena in detail simulated because real world too expensive; too slow or impossible make prognoses about what can happen in certain situations test using real observations, typically obtained via a CASE Examples distributed systems (grid); network protocols too expensive or too slow to test in real life embedded systems simulating hardware platforms impossible to observe real clock-speed / 41 http://win.ua.ac.be/~sdemey/Tutorial_ResearchMethods/
  • Slide 42
  • Back to our example Why this experiment? Despite decades of research there is no empirical data of text entry performance of handwriting recognition An inappropriate study of handwriting (sans recognition) from 1967 keeps getting cited in the literature, often through secondary or tertiary sources (handbooks, etc.) Based on these numerous citations in research papers, handwriting recognition is perceived to be rather slow However, there is no empirical evidence that supports this claim 42 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 43
  • Controlled experiments and hypotheses A controlled experiment tests the validity of one or more hypothesis Here we will consider the simplest case: One method vs. another method Each method is referred to as a condition The null hypothesis H0 states there is no difference between the conditions Our hypothesis H1 states there is a difference between the conditions To show a statistically significant difference the null hypothesis H0 needs to be rejected 43 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 44
  • Choice of baseline A baseline needs to be accepted by your readers as a suitable baseline Preferably the baseline is the best method that is currently available In practice a baseline is often a standard method which is well understood but often not representative of the state-of-the- art 44 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 45
  • Example Our example, two conditions 45 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 46
  • Why this baseline? The software keyboard is well understood Many empirical studies of their performance Also exists expert computational performance models The software keyboard is the de-facto standard text entry method on tablets The literature compares handwriting recognition text entry performance against measures of the software keyboard 46 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 47
  • Aim of controlled experiment To measure effects of the different conditions To control for all other confounding factors To be internally valid To be externally valid To be reproducible 47 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 48
  • Experimental design Dependent and independent variables Within-subjects vs. between-subjects Mixed designs Single session vs. longitudinal experiments 48 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 49
  • Dependent and independent variables Dependent variable: What is measured Typical examples (in CS): time, accuracy, memory usage Independent variable What is manipulated Typical examples (in CS): the system used by participants, feedback to participant (e.g. a beep versus a visual flash) 49 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 50
  • Deciding what to manipulate and what to measure This is a key issue in research Boils down to your hypothesis: What do you believe? How can you substantiate your claim by making measures? What can you measure? Is it possible to protect internal validity without sacrificing external validity? 50 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 51
  • Our example We let participants write phrases using either: Software keyboard (baseline) Handwriting recognition That is, we manipulate the input method We measure: Entry rate in words-per-minute Error rate in number of written characters that do not match the stimulus 51 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 52
  • Between-subjects design Each participant is exposed to only one condition For example, in a study examining the effect of Bayer aspirin vs Tylenol on headaches, we can have 2 groups (those getting Bayer and those getting Tylenol). Participants get either Bayer OR Tylenol, but they do NOT get both. One of the simplest experimental designs Advantages: No risk of confuse or skill-transfer from one condition to the other Therefore no need to do counter-balancing or check for asymmetrical skill-transfer effects Disadvantages: Variance is not controlled within the participant Therefore demands more participants than a within-subjects design to show a statistically significant difference 52 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 53
  • Within-subjects design Each participant is exposed to all conditions One of the most common experimental designs in practice Advantages: Variance is controlled within the participant Therefore requires fewer participants than a between-subjects design Disadvantages: More involved, requires counter-balancing of start condition to avoid transfer effects Risk of asymmetrical skill transfer 53 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 54
  • Mixed designs It is also possible to combine within- and between-subjects experimental designs Such designs are called mixed designs These are difficult to design because they are more difficult to control A mixed design can be a symptom of no clear set of hypotheses, or lack of ability to prioritise among them Often a mixed design can be broken down into smaller studies that study isolated phenomena separately 54 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 55
  • Single session vs. longitudinal Do you believe participants will improve significantly over time? If so, how much will they improve? How are previous related studies set up in the literature? 55 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 56
  • Method Participants Apparatus Procedure 56 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 57
  • Participants How many? How is the sample constructed? Is it representative of the population we believe will use the interface? Are potential problematic confounds taken care off? Did participants receive any compensation? Was the study approved by the university ethics committee? [if applicable] 57 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 58
  • Our example We recruited 12 volunteers from the university campus. We intentionally wanted a rather broad sample and recruited participants from many different departments with many different backgrounds. Six were men and six were women. Their ages ranged between 22-37 (mean = 27, sd = 4). Participants were screened for dyslexia and repetitive strain injury (RSI). Seven participants were native English speakers and five participants had English as their second language. No participant had used a handwriting recognition interface before. One participant had used a software keyboard before. No participant had regularly used a software keyboard before. Participants were compensated 10 per session. 58 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 59
  • Apparatus Which equipment and which software? Needs to be described in sufficient detail to enable other researchers to replicate your experiment Typical information: Physical and logical screen size Sensor device characteristics CPU clock speed Computer brand/model Choices that are not obvious need to be motivated 59 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 60
  • Apparatus, our example We used a Dell Latitude XT Tablet PC running Windows Vista Service Pack 1. The 12.1" color touch-screen had a resolution of 1280 800 pixels and a physical screen size of 261 163 mm. Participants used a capacitance-based pen to write directly onto the screen in both conditions. Both the handwriting recognizer and the software keyboard were docked to the lower part of the screen. The dimensions of the software keyboard were 1266 244 pixels and 257 50 mm. The dimensions of the handwriting recognizer writing area measured 1266 264 pixels and 257 55 mm. 60 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 61
  • Procedure Describes how the experiment was carried out Needs to be described in sufficient detail for other researchers to be able to replicate your experiment Again, choices need to be motivated 61 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 62
  • Procedure, our example The experiment consisted of one introductory session and ten testing sessions. In the introductory session the experimental procedure was explained to the participants. Participants were shown how to use the software keyboard and the handwriting recognizer, including demonstrations of how to correct errors. Each testing session lasted slightly less than one hour. Testing sessions were spaced at least 4 hours from each other and subsequent testing sessions were maximally separated by two days. In each testing session participants did both conditions (software keyboard and handwriting recognition). The order of the conditions alternated between sessions and the starting condition was balanced across participants. Each condition lasted 25 minutes. Between conditions there was a brief break. Participants were also instructed that they could rest at any time after completing an individual phrase. 62 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 63
  • Procedure, our example In each condition participants were shown a phrase drawn from the phrase set provided by MacKenzie and Soukoreff [8]. Each participant had their own randomized copy of the phrase set. Participants were instructed to quickly and accurately write the presented phrase using either the software keyboard or the handwriting recognizer. Participants were instructed to correct any mistakes they spotted in their text. In the handwriting condition we instructed participants to write using their preferred style of handwriting (e.g. printed, cursive or a mixture of both). After they had written the phrase they pressed a Submit button and the next phrase was displayed. The Submit button was a rectangular button measuring 248 16 mm. It was placed 9 mm above the keyboard and handwriting recognizer writing area. 63 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 64
  • After the experiment Results Limitations and implications 64 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 65
  • Our Example 65 http://www.cl.cam.ac.uk/teaching/0910/C00/L10/
  • Slide 66
  • Summary A well-designed controlled experiment provides you empirical evidence that your new method is better [in some aspects] than some previous method in the literature (a baseline) Important to consider the experimental design early Within vs. between Dependent and independent variables Internal and external validity Pilot study often a good idea (perhaps your method has a fatal flaw) Important to point out limitations and implications Experiments must be reproducible http://www.cl.cam.ac.uk/teaching/0910/C00/L10 66