Professional Documents
Culture Documents
Chapter 1: Introduction
What is NOT research?
o
o
o
What is research?
o
o
o
Definition: Something that people undertake in order to find out things in a systematic way,
thereby increasing their knowledge.
Transdisciplinary nature
Development of ideas that are related to practice Requirement to have some practical
consequence
Personal or commercial advantages related to research
Theory + practice (?)
FUNDAMENTAL RESEARCH
Purpose:
o Expand knowledge of processes
o Universal principles
o Findings of significance and value to
society in general
o Find rules that might explain a theory
(E.g.: How students are motivated)
APPLIED RESEARCH
Purpose:
o Improve understanding of particular
problem
o Results in solution to problem
o New knowledge limited to problem
o Findings of particular relevance
o To solve practical problems
(E.g.: lack of motivation in a company)
Context:
o Universities (Mostly academic context)
o Choice determined by researcher
o Flexible time scales
Context:
o Organisations and universities
(E.g.: Consultancy)
o Negotiation with originator
o Tight time scales
Wherever your research lies, whether fundamental or applied, or anywhere in between, you should
undertake it with rigour.
Pay careful attention to the research process!
Formulate and clarify the research topic (What: Formulate the question. Make a
good/relevant question)
Develop the research design. (How am I going to develop the research? Who are my target
group?)
Gather data
Analyse and interpret the data
Write the project report (Write research work with an answer to question)
Intellectual reasons
Practical reasons
Personal reasons
No omnivore-research question
o
o
Not everything can be investigated, and what you investigate cannot be known to perfection
How many times do Flemish people go to the cinema to watch Flemish movies?
Problem statement:
Company X deals with significant financial problems. To solve these problems, the turnover should
increase with at least 10 percent during the next 18 months.
Research questions?
o
o
Problem statement:
Previous academic research thoroughly investigated the shoplifting phenomenon as it negatively
influences business, other consumers, and society more generally. Although many examined the
socio-demographic profile of shoplifters, research about how to prevent shoplifting among
consumers is rather scarce.
Research questions?
o
o
DESCRIPTIVE RESEARCH
o
o
o
o
EXPLANATORY/CAUSAL RESEARCH
To establish relationships between variables
o
o
E.g.: Quantitative study investigating whether certain colours in the shop lay-out result in
higher levels of customer satisfaction
E.g.: Qualitative study investigating whether Corporate Social Responsibility activities in a
company influence employee involvement
Gender
Independent
Variable
Study Results
Discipline
(Mediator)
Dependent
variable
Gender
Independent
variable
Results
Belgium
v.s
Other countries
(Moderator)
Dependent
variable
Question:
Which of the following statements is false?
A) Profiling KUL-students in terms of gender and age is an example of descriptive research.
B) Exploratory research may follow descriptive or causal research.
C) When little is known about the problem situation, it is desirable to start with exploratory
research.
D) Investigating whether and why a decrease in price influences sales and market share results
in descriptive research.
Answer: D
RESEARCH OBJECTIVES
Might be useful for complex research questions. It comes after the question.
What?
Operationalise how you intend to conduct your research by providing a set of coherent and
connected steps to answer your research question.
Why?
o
o
What kind of work do I need to do in order to answer my research question? What successive steps
do I need to take in order to answer my research question?
These are statements, not questions and they are numbered in a list.
Example:
As a sales manager, you notice that your sales staff becomes less and less motivated to sell the
companys products. Therefore, you decide to investigate in which way you could increase the level
of motivation among your sales staff.
To define the concept of motivation
To review key literature on the existing measures to motivate sales people
To identify the strengths and weaknesses of the identified measures
To determine which measures are most relevant to use in the context of my company
To carry out primary research in my company to measure the effectiveness of the selected
measure
HYPOTHESES
It is something you would like to test. It is based on a theory, but not always. E.g.: For inductive
approach.
Directional hypotheses:
o
o
The direction of the relationship between the variables is indicated. E.g.: The greater the
stress experienced in the job, the lower the job satisfaction of employees
The difference between two groups on a variable is postulated. E.g.: Women are more
motivated than men to lose weight
Non-directional hypotheses:
o
E.g.: Belgians consume much candy, Americans dont. (should avoid using words like much)
No point of reference
o
However, this would not be problematic when following the inductive approach (see
afterwards)
Hypothesis
o
A tentative, yet testable, statement which predicts what you expect to find in your
(empirical) data. E.g.: Financial punishments are more effective in preventing consumers
from shoplifting compared to social punishments.
Question:
Is the following hypothesis well formulated? Explain your answer.
Belgian adolescents have a better self-image compared to French adolescents.
Answer: This is a good directional hypothesis. Because it is well formulated and there is no
ambiguous word and with is a clear point of reference.
Question:
Research topic: Salespeople of Samsung and their preference for payment by commission vs. salary
Formulate a research question in line with the research topic above that results in descriptive
research and a research question that results in causal research.
Answer:
Descriptive Research: What percentage prefer payment by salary versus payment by commission?
Causal Research: How many products are sold by salespeople paid by salary in comparison to
salespeople paid by commission?
RESEARCH PARADIGM
The development of your research design will be influenced by your research paradigm
o
o
A cluster of beliefs and dictates which for scientists in a particular discipline influence what
should be studied, how research should be done and how results should be interpreted
A basic belief system or worldview that guides the investigator, not only in choices of
method but in ontologically and epistemologically fundamental ways
The difference between research paradigms is based on assumptions within three domains:
Ontology:
What is the reality? How does reality look like? Is there a reality external to
humans? If yes, what does it look like?
Epistemology: How can we built knowledge about that reality? How do we know what we know?
What counts as knowledge, what doesnt? How is the relationship between
researcher and subject?
Methodology: How can the researcher acquire knowledge about his beliefs? (Is limited by
ontological and epistemological viewpoints.)
POSITIVISM
o
o
o
o
o
o
o
Positivism
Explaining relationships
Accumulating data
Objective process
Knowledgeable researcher, known
subjects
Theory verification
Deduction Testing hypotheses
Focus on quantitative research
CONSTRUCTIVISM
o
o
o
o
o
o
o
Constructivism
Understanding subjects meaning
Constructing information
Intersubjective process
Researcher becomes involved with the
subjects
Theory building
Induction Developing hypotheses
Focus on qualitative research
There are many other research paradigms in between the extremes of positivism and
constructivism!
It could happen that a qualitative research could be positivism.
Example:
Relationship between CSR activities within a company and employees involvement.
How would a positivist deal with this topic?
He would test a theory. E.g.: Giving the employees a quantitative questionnaire.
How would a constructivist deal with this topic?
He wouldnt start with theory. Depending on the field of study, he would develop a theory.
Question:
Which of the following statements is false?
A) Quantitative research can follow an inductive approach.
B) Qualitative research might be inductive as well as deductive.
C) In positivistic research, the researcher intervenes in the research process.
D) Examining peoples motivation for luxury consumption can be done by quantitative as well
as qualitative research.
Answer: C
B Might be true sometimes. But not always. Not typical. C Qualitative is more appropriate but we
can have a quantitative approach as well. (E.g. checking consumption)
RESEARCH APPROACH
The development of your research design will also be influenced by your research approach
Theory
Induction
Data
Deduction
Once youve collected data and made your theory, you can decide to test it again.
Deductive or Inductive?
1. It rains, everything outside becomes wet. It rains. The car is outside.
The car will become wet.
Answer: Deduction
2. The first duck in the park is brown. The second duck in the park is brown. The third duck in
the park is brown.
Every duck in the park is brown.
Answer: Induction
Some practical criteria:
o
o
o
o
o
Inductive approach is more risky. You might not develop a good theory with your data.
Question:
Which of the following statements is true?
A) With deduction, data are collected and a theory developed as a result of the data analysis.
B) Research projects should include either the deductive or inductive research approach.
C) A research topic about which little literature exists, is more likely to result in an inductive
research approach than a deductive research approach.
D) The deductive research approach is less strict compared to the inductive research approach.
Answer: C
Patrick is a member of the Human Relations Research Group of KUL. He read about the large amount
of adolescents slipping into shoplifting behaviour and wonders how this behaviour could be
prevented. Therefore, he runs a study in which he tests whether the Protection Motivation Theory is
applicable to this particular issue. Patricks study leans towards:
a) An inductive research approach
b) A deductive research approach
Answer: b
He develops some expectations/theory. Then he gathers data to test the theory.
In sum, you should be aware of the fact that research paradigm and research approach influence
your research design
Core elements of your research design are:
o Research choice
o Research strategy
o Time horizon
Still, it is possible that a quantitative research design is more in line with induction, and that a
qualitative research design is more in line with deduction Many research designs are thus likely to
combine quantitative and qualitative elements. No need to learn figure by heart.
Triangulation is one of the advantages of using more than one data collection technique and
analysis procedure
Multiple methods may be used in order to combine data to ascertain if the findings
from one method mutually corroborate the findings from the other method. ( to see
whether we have the same result)
RESEARCH STRATEGY:
We have different ways of conducting a research. We can combine different strategies within the
same project.
Various research strategies exist:
o
o
o
o
o
o
o
o
Experiment
Survey
Archival Research
Case Study
Ethnography
Action research
Grounded theory
EXPERIMENT
The only way to investigate Causal Relationship. To infer whether a change in one or more
independent variables produces a change in one or more dependent variables.
E.g.:
Independent
Mood (A)
Negative
Dependent
Creativity (B)
Positive
CLASSIC EXPERIMENT
o
o
Pre-test
measurement of
Purchasing
Behaviour
Post-test
measurement
of
Purchasing
Behaviour
Example:
Independent
Dependent
Promotion
Purchasing Behaviour
No promotion
Control group
Promotion
Experimental group
Internal Validity:
The extent to which the findings can be attributed to the
interventions rather than any flaws in your research design.
External Validity:
Whether the cause-and-effect relationship(s) found in the
experiment can be generalised.
Question:
o
As a marketer, you are wondering whether rock versus pop music in supermarkets
influences the time consumers spend in these supermarkets.
Design an experiment which would enable this marketeer to find an answer on this
problem.
Answer:
Independent
Music
Dependent
Time spent in supermarket
ARCHIVAL RESEARCH
o
Analysis of administrative records and documents as principal source of data because they
are products of day-to-day activities
Secondary data analysis: Data are part of the reality being studied rather than having been
collected originally as data for other (research) purposes
Allows research questions which focus upon the past and changes over time to be
answered
Disadvantages might be the nature of the records and documents, missing data, and access
to data (confidentiality)
CASE STUDY
o
The boundaries between the phenomenon being studied and the context within which it is
being studied are not clearly evident
Experiment: Research undertaken in a highly controlled context
Survey: Ability to explore and understand the context is limited by the
number of variables for which data can be collected
Has considerable ability to generate answers to why, what and how questions
TRIANGULATION
Example:
o
o
o
o
o
Building high quality interaction and cooperation during organisational change (Grieten &
Lambrechts, 2007, 2009)
Problem definition: 2/3 of change processess fails, although it is known that these failures
are often caused by relational aspects
Research question: What makes relational practices of such a quality that they improve
common progress during organisational change?
Case selection: Two organisations with contrasting change processes in terms of results
(best practice and worst practice), but similar in terms of relational approach
Data collection methods: (participant) observation, in-depth interviews, focus groups,
document analysis (Triangulation!)
ETHNOGRAPHY
o
Used for studying people in groups, who interact with one another and share the same
space (e.g., street level, work group, organisation, )
Focuses upon describing and interpreting the social world through first-hand field study
o
Researchers living amonst those whom they study, to observe and talk to them in
order to produce detailed cultural accounts of their shared beliefs, behaviors,
interactions, language, rituals and the events that shaped their lives
ACTION RESEARCH
o
An emergent and iterative process of inquiry that is designed to develop solutions to real
organisational problems through a participative and collaborative approach, which uses
different forms of knowledge, and which will have implications for participants and the
organisation beyond the research project
Demanding strategy in terms of the intensity involved and the resources and time required
GROUNDED THEORY
Uses Inductive research approach. We start with the data and theory/a relevant model.
o
Theory is developed through the systematic and simultaneous process of data collection and
analysis involving a mainly inductive approach
to generate theory grounded in your data
Example:
Nyilasy, G., & Reid, L.N. (2009). Agency practitioners metatheories of advertising. International
Journal of Advertising, 28(4), 639-668.
o
What do advertising agency practitioners think about how advertising works? This
studys basic aim was to understand practitioners thinking about the work of
advertising in their own terms. As there was little substantive research of this
perspective, a grounded theory approach to qualitative research was used.
TIME HORIZON:
Cross-sectional studies
o
Longitudinal studies
o
Be careful for relevant changes in variables you do not take into account!
When developing your research design, you should also consider the ethnics and the quality of your
research design.
Reliability: Consistency in research. Is it consistent when I replicate exactly the same experiment.
Validity: Testing the right variable.
73 kg
73 kg
73 kg
73 kg
Real weight= 78 kg
VALIDITY
It doesnt measure what it intends to measure.
RELIABILITY
Reliability does not involve validity!!! &
Not Reliable
Not Valid
Reliable
Not Valid
Not Reliable
Valid
Reliable
Valid
Question:
1. The student administration department of HUB examines the extent to which HUB-students
are satisfied with the teaching skills of the HUB-staff. By means of a questionnaire on Time 1,
researcher X finds that the overall satisfaction is equal to 8.7 on 10. Two weeks later (Time
2), researcher X conducts the same research (among the same respondents) and finds that
the overall satisfaction is equal to 8.7 on 10. Consequently, researcher Xs results are:
Valid: ? => No information to tell us whether it is valid or not.
Reliable: Yes
2. You developed a measurement instrument to examine employees level of job autonomy
perception (i.e., the extent to which they experience autonomy in their job). This
measurement instrument seems to be sensitive to social desirability (i.e., respondents
tendency to give answers that may be desirable from a social standpoint / when people
answer according to what they think is expected of them and not according to their own
opinion.)
Question: What is the implication of social desirability for the quality of your measurement
instrument?
Not valid because of measuring job autonomy, they are asking/measuring social desirability.
3. Which of the following statements is correct?
A) Experiments are more valid compared to surveys
B) If a study is reliable, it means that it measures what we think it should measure
C) External validity is about the extent to which the reliability of a study can be
generalised
D) An interviewer who writes down a wrong answer from absent-mindedness threats
the reliability of his study
Answer: D => Just one instance of absent-mindedness will not influence the validity of
the research.
Chapter 4: Sampling
Sampling is about selecting a number of elements from a populaton you would like to study, with
the intention to derive characteristics of the population from characteristics of the sample.
What is the normal range in miles that can be travelled by electric cars in everyday use?
Population: Electric cars you use everyday
Checklist:
Are elements listed in the sampling frame relevant to your research question?
How recently was the sampling frame compiled, in particular is it up to date?
Does the sampling frame includes all elements, in other words is it complete?
Does the sampling frame contain the correct information, in other words is it accurate?
Does the sampling frame exclude irrelevant cases, in other words is it precise?
For purchased lists and online panels, can you establish and control precisely how the
sample will be selected?
For an online panel, can you establish whether incentives will be used to enhance the likely
response and provide an assessment of the impact of this on respondent characteristics and
consequently responses?
You should not generalise beyond your sampling frame
E.g.: Sampling frame consists of all employees of an organisation You can only generalise to
employees of that particular organisation
Sometimes not possible (or very hard) to develop a sampling frame!
Question:
Which sampling frame is suited for the following research questions?
How do employees of Carrefour think the proposed introduction of compulsory Sunday working will
affect their working lives?
Answer:
Which factors influence Belgian lawyers decision to work in other European countries?
Answer:
SELECT SAMPLING TECHNIQUES
First of all you need to decide whether you will examine all elements of the population (=census) or
you will dram a sample
For populations fewer than 50, it is usually more sensible to collect data from the entire population.
Draw a sample => Conditions:
o Practical constraints
o Budget constraints
o Time constraints
o Access constraints
o Results need to be quickly available
o Testing includes destroying of population (e.g.: Establish the actual duration of long-life
batteries)
Two types of sampling:
1. Probability Sampling
2. Non-probability Sampling
The probability of each case being selected from the total population is not known.
It is impossible to answer research questions that require you to make statistical inferences
about the characteristics of the population.
Note: You may still be able to generalise from non-probability samples about the
population, but not on statistical grounds.
Question:
Which of the following statements is true?
A) With probability samples the chance, or probability, of each case being selected from the
population is unknown.
B) Generalizations about populations from data collected using any probability sample are
based on intuition.
C) Sampling provides a valid alternative to a census when it would be impracticable for you to
survey the entire population.
D) The sampling frame gives an overview of all the elements which will be included in your final
sample.
Answer: C
Probability Sampling Techniques
o Simple random sampling
o Systematic random sampling
o Stratified random sampling
o Cluster sampling
o Multi-stage sampling
If the same number is read off a second time, it must be disregarded as you need different cases.
This means that you are not putting each cases number back into the sampling frame after it has
been selected. This is termed sampling without replacement.
If a number is selected that is outside the range of those in your sampling frame, you simply ignore it
and continue reading off numbers until your sample size is reached.
Disadvantages of this procedure:
o Time-consuming
o Requires adapted table with sufficient radom numbers
Other random procedure:
o Computer generated random numbers / Online random number generator ( random
number tables)
o Random telephone numbers
- Often used when doing computer-aided telephone interviewing (CATI)
- Dialing telephone numbers at random from an existing database
- Or random digit dialling
+
Does not consider the telephone book
Some households have more than one telephone number!
Simple random sampling:
o Sample without (systematic) bias
o Best used when you have an accurate and easily accessible sampling frame that lists the
entire population
Disadvantage: These lists are not always available!
o If your population covers a large geographical area, random selection means that selected
cases are likely to be dispersed throughout the area
Disadvantage: This sample is not suited if collecting data over a large geopgraphical
area using a method that requires face-to-face contact (high travel costs)
Example:
Jemma was undertaking her work placement at a large supermarket, where 5011 of the
supermarkets customers used the supermarkets Internet purchase and delivery scheme. She was
asked to interview customers and find out why they used this scheme. As there was insufficient time
to interview all of them, she decided to interview a sample using the telephone. Her calculations
revealed that to obtain acceptable levels of confidence and accuracy she needed an actual sample
size of approximately 360 customers. She decided to select them using simple random sampling.
Having obtained a list of Internet customers and their telephone numbers, Jemma gave each of the
cases (customers) in this sampling frame a unique number. In order that each number was made up
in exactly the same way she used 5011 four-digit numbers starting with 0000 through 5010. So
customer 677 was given the number 0676.
She selected at random a first random number in the random number table. After that, she read off
the other random numbers in a regular and systematic manner. She continued in this manner until
360 different cases had been selected. These formed her random sample. Numbers selected that
were outside the range of those in her sampling frame (such as 8321, 5953 and 7932) were simply
ignored.
Similar to Simple random sampling but in a systematic order. We apply an interval for sample
selection.
Example:
Number each of the cases in your sampling with a unique number (0, 1, 2 )
1500 patients: number each of these patients (0,1,2 1499) Sample of 300
participants
Calculate the sampling fraction (actual sample size/total population)
Sampling fraction: 300/1500=1/5
Select the first case using a random number (depends on sampling fraction)
Random starting point (i.e., random number between 0 and 4)
Select subsequent cases systematically (until sample size is reached) using the sampling fraction to
determine the frequency of selection.
Continue to select every fifth patient until the sample size of 300 patients is reached.
Systematic random sampling:
o Sometimes not necessary to develop a sampling frame (e.g., every tenth visitor of a
website)
o Easy to understand and to explain
o Despite these advantages, be careful when using existing lists as sampling frames
- You need to ensure that the lists do not contain period patterns! (See next 2 slides)
- Systematic random sampling is suitable for geographically dispersed cases only if
you do not require face-to-face contact when collecting data ( simple random
sampling)
The impact of period patterns on systematic random sampling:
Consider the use of systematic random sampling to generate a sample of monthly sales from the
Harrods store in London. The sampling frame contains monthly sales for the last 60 years. A
sampling interval of 12 is chosen.
A high street bank needs you to administer a questionnaire to a sample of individual customers with
joint bank accounts
Sampling fraction = 1/2 = you will need to select every second customer on the list
The names of the customer list, which you intend to use as the sampling frame, are arranged as
depicted below:
She decided to select a systematic random sample. A sampling fraction of 1/4 meant that she
needed to select every fourth customer on the list. As indicated by the ticks, random numbers were
used to select the first case in the public sector (001) and private sector (003) strata. Subsequently,
every fourth customer in each stratum was selected.
o
o
Dividing the population into a series of relevant strata means that the sample is more likely
to be representative, as you can ensure that each of the strata is represented proportionally
within your sample.
Proportionate stratified random sampling = the sample size drawn from the strata
are proportionate to the stratas share of the total population
Disproportionate stratified random sampling (oversampling enables separate analyses)
Despite the advantages of proportionate and disproportionate sampling, there are some
disadvantages as well:
- Only possible if you can easily distinguish significant strata (in your sampling frame)
- Extra stage of sampling procedure
More time
More expensive
More difficult to explain compared to simple and systematic random
sampling
Cluster Sampling
All elements of a number of randomly selected clusters are selected
More concrete:
o Choose the cluster grouping for your sampling frame
- Heterogeneity in clusters is important! Cluster small universe
(e.g., Population=football lovers; Cluster=football stadium)
o Number each of the clusters with a unique number (0, 1, )
o Select your sample of clusters using some form of random sampling
o Select all elements of the selected clusters
Multi-stage sampling
Select a stage and research within the cluster.
o
Modifying a cluster sample by adding at least one more stage of sampling that also involves
some form of random sampling
o Procedure:
Choose the cluster grouping for your sampling frame
Heterogeneity in clusters is important!
Number each of the clusters with a unique number (0, 1, )
Randomly select a number of clusters
Repeat the above steps (e.g., districts cities neighbourhoods streets)
Randomly select elements of the most recently selected clusters
Example:
Laura worked for a market research organisation that needed her to interview a sample of 400
households in England and Wales. She decided to use the electoral register as a sampling frame.
Laura knew that selecting 400 households using either systematic or simple random sampling was
likely to result in these 400 households being dispersed throughout England and Wales, resulting in
considerable amounts of time spent travelling between interviewees as well as high travel costs. By
using multi-stage sampling, Laura fest these problems could be overcome.
In her first stage, the geographical area (England and Wales) was split into discrete sub-areas
(counties). These formed her sampling frame. After numbering all the counties, Laura selected a
small number of counties using simple random sampling. Since each case (household) was located in
a county, each had an equal chance of being selected for the final sample.
As the counties selected were still too large, each was subdivided into smaller geographically
discrete areas (electoral wards). These formed the next sampling frame (stage 2). Laura selected
another simple random sample. This time she selected a larger number of wards to allow for likely
important variations in the nature of households between wards.
A sampling frame of the households in each of these wards was then generated using a combination
of the electoral register and the UK Royal Mails postcode address file. Laura finally selected the
actual cases (households) that she would interview using systematic random sampling.
Multi-stage sampling:
Advantages:
o Geographically dispersed population becomes possible against lower cost.
o Compared to normal cluster sampling, larger clusters with many cases is possible
Disadvantages:
o Selecting smaller and smaller subgroups might impact the representativeness of your sample
Can be solved through applying stratified random sampling techniques as well
Question:
o
BNP Paribas Fortis has about 400 000 Benelux-clients using their credit card. The credit card
application form contains common information such as name, address, age, telephone
number, educational level, etc.
BNP Paribas Fortis wants to examine whether there is a relationship between the way in
which credit cards are used (e.g., frequency of use) and the socio-economic profile of its
users.
Questions: Identify the population and the sampling frame. Consider the suitability
of the various probability sampling techniques in this situation.
Answer:
Frequency Control:
Representative in terms of criterion
Question:
An association has 750 members. In the table below, the distribution of these members is given in
terms of gender and age.
Quota Sampling:
Advantages compared to probability sampling techniques
o Less costly
o Can be set up very quickly
o Does not require a sampling frame
Disadvantages
o Because the interviewer can choose within quota boundaries whom they interview, your
quota sample may be subject to bias (e.g., easily accessible respondents who appear to be
willing to answer the questions)
o As the sample is not probability based, you cannot measure the level of certainty or margins
of error
Judgemental Sampling
o
o
o
= Purposive sampling
You need to use your judgment to select cases that will best enable you to answer your
research question
Often used when:
- Working with very small samples (such as in case study research or when you wish
to select cases that are particularly informative)
E.g., Industrial research among experts
- Doing qualitative research
- Doing exploratory research
Snowball Sampling
Commonly used when it is difficult to identify members of the desired population
Procedure:
o Make contact with one or two cases in the population
o Ask these cases to identify further cases
o Ask these new cases to identify further new cases (and so on)
o Stop when either no new cases are given or the sample is large enough (or when theoretical
saturation is reached)
Main problem = Making initial contact
Problems of bias is huge
o Respondents are most likely to identify other potential respondents who are similar to
themselves, resulting in a homogeneous sample
Self-selection Sampling
o
o
It occurs when you allow each case, usually individuals, to identify their desire to take part in
the research
You therefore:
- Publicise your need for cases either by advertising or by asking them to take part
- Collect data from those who respond
Problem = representativeness
- Cases that self-select often do so because of their feelings or opinions about the
research question
Convenience Sampling
o
o
o
Involves selecting cases haphazardly only because they are easily available (or most
convenient) to obtain for you sample
- E.g., the person interviewed at random in a shopping centre for a television
programme
Widely used
Advantages:
- Cheap
- Quick (Suited for exploratory research)
Though prone to bias and influences that are beyond your control Cases appear in the
sample only because of the ease of obtaining them. Bias decreases as the population
becomes more homogeneous
Question:
For the following research question, it has not been possible for you to obtain a sampling frame.
Suggest the most suitable sampling technique to obtain the necessary data, and motivate your
choice.
Research question: Would users of the tennis club be prepared to pay a 10 per cent increase in
subscriptions to help fund two extra tennis courts? You need the answer by tomorrow morning.
Answer:
Convenience sample (not much time)
But if we have time, probability sample technique
EXAM: If we have sampling frame => Probability technique
For many research projects, you will have to combine different sampling techniques
Question:
Is the following statement true or false? Motivate your answer
Stratified sampling can be seen as random quota sampling
Answer:
o
o
o
o
o
o
o
We are thus interested in using the sample statistic (e.g., the sample mean) as an estimate
of the value in the population
An approach to assessing the accuracy of the sample mean as an estimate of the mean in
the population is to calculate boundaries (Confidence Intervals) within which we believe the
true value of the mean will fall
Typically, we look at 95% confidence intervals
This means that for 95% of the time, the true value of the population will fall within the
boundaries of the confidence interval
In other words, if you would collected 100 samples, calculated the mean and then calculated
a confidence interval for that mean, then for 95 of these samples, the confidence intervals
we constructed would contain the true value of the mean in the population
X = sample mean
= population mean
= standard deviation of population
n = sample size
Confidence level (Z)
The range of a normally distributed variable is approximately equal to +/- 3 standard deviations, and
one can thus estimate the standard deviation by dividing the range by 6.
Example:
Suppose a researcher wants to estimate the monthly household savings more precisely so that the
estimate will be within +/- 5 of the true population value. What should be the size of the sample?
1) The researcher should specify the level of precision. This is the
maximum permissible difference between the sample mean
and the population mean.
D=5
2) The researcher should specify the level of confidence and determine the z-value associated
with this confidence level
Confidence level=95% => z-value=1.96
3) The researcher should determine the standard deviation of the population.
Secondary sources indicate a standard deviation of 55 (=)
D=5
z=1.96
=55
Sample size:
The larger the population, the larger the sample size
The higher the degree of confidence, the larger the sample size
The higher the degree of precision, the larger the sample size
The choice of sample size is thus governed by:
o The confidence you need to have in your data: The level of certainty that the characteristics
of the data collected will represent the characteristics of the total population
o The margin of error that you can tolerate: The accuracy you require for any estimates made
from your sample
o The variability in the population in terms of the variable(s) of interest
EXAM questions:
A big company wants to know how much money (in euro) each of its managers spends on lunches
per month. It is known that the maximum amount of money spent is 700 euros while the minimum
is 400 euro. The company wants the result to be accurate in terms of 5 euro and wants to make a
prediction with a confidence level of 95%.
How large should the sample size be?
Answer:
Level of precision: D = 5
Confidence 95%: z = 1.96
= 700-400/6 = 50
N= (1.96 * 50) / 5 = 385
? Population proportion?
Secondary sources, pilot study, or
conservative (=0.5)
Example:
Suppose a researcher is interested in estimating the proportion of households in a particular region
that have bought clothes online. What should be the sample size?
1) The researcher should specify the level of precision. This is the maximum permissible
difference between the sample proportion and the population proportion.
D = 0.5
2) The researcher should specify the level of confidence and determine the z-value associated
with this confidence level.
Confidence level=95% z-value = 1.96
3) The researcher should determine the population proportion.
Secondary sources indicate a population of 0.64
D = 0.05
z = 1.96
= 0.64
EXAM Question:
A researcher wants to know the percentage of households that has a loyalty card of a certain
supermarket. You desire a precision level of 5 percentage points (and a 95% confidence level).
How large should the sample size be?
Level of precision: D = 5
Confidence 95%: z = 1.96
= 50
Non-probability sampling
o
o
o
First of all, determine the minimal sample size (taking into account certain
confidence and precision levels)
n = n x 100 / re%
Once data are collected from a sample, comparisons between the structure of the sample
and the structure of the population should be made
If it is found that the structure of a sample does not match the target population (due to
population specification error, sampling frame error, sample selection bias, non-response
bias), a weighting scheme can be used
- A statistical procedure that attempts to account for these errors/biases by assigning
differential weights to the data depending on the response rates
Weighing
Often used in research projects that also collect primary data (But you can also use them on
their own or with other sources of secondary data!)
Include text materials and non-text materials. (can be nice to create a background with text)
Can be analysed both quantitatively and qualitatively
Can be used to help to triangulate findings based on other data
Documentary sources you have available can depend on access issues as well as succes in
locating these sources
Data collected using a survey strategy (e.g., questionnaires) that have already been analysed
for their original purpose
Collected through one of three distinct subtypes of survey strategy:
o Censuses
- Usually carried out by governments Data are often:
clearly defined
well documented
of high quality
easily accessible
widely used
- Are unique as, unlike surveys, participation is obligatory Therefore,
they provide very good coverage of the population surveyed
- E.g., population and housing censuses
o
Ad-hoc surveys (result of one survey/ doing the survey just once)
- = A general term normally used to describe the collection of data
that only occurs once due to the specificity of focus
- Usually one-off surveys
- Usually far more specific in their subject matter
- Because of their ad hoc nature, it will probably be more difficult to
discover relevant surveys
= Secondary data created by combining two or more different data sets prior to the data
being accessed for the research. These data sets can be based entirely on documentary or
on survey data, or can be an amalgam of the two
E.g., Various compilations of company information
o E.g., Europes 15,000 Largest Companies
Some methods of compilation
o Extract and combine selected comparable variables from a number of surveys or
from the same survey that has been repeated a number of times to provide
longitudinal data (time-series data)
o Data compiled from the same cases over time using a series of snapshots to form
cohort studies
o Secondary data from different sources can be combined, if they have the same
geographical basis, to form area-based data sets (E.g., Europe in Figures: Eurostat
Yearbook)
Question
The Facebook-page of McDonalds is an example of:
a) Documentary secondary data
b) Survey secondary data
c) Multiple source secondary data
d) None of the above types of secondary data
Answer: A) Documentary
HOW TO LOCATE SECONDARY DATA?
Are the data you need available?
Requires you to:
STEP 1: Establish whether the sort of data you require are likely to be available as secondary data
STEP 2: Locate the precise data you require
STEP 1: ESTABLISHING THE LIKELY AVAILABILITY IF SECONDARY DATA
Literature review (Reference list)
Quality national newspapers
Subject-specific textbooks
Tertiary literature (e.g., indexes and catalogues)
Informal discussions
STEP 2: LOCATE SECONDARY DATA
Once you have ascertained that secondary data are likely to exist, you need to find their
precise location
o Relatively straightfoward for secondary data held in online databases or held by
specialist libraries
o Data held by organisations are more difficult to locate (time consuming, quality?, )
o Once you have located a possible secondary data set, you need to be certain that it
will meet your needs
1. OVERALL SUITABILITY
Measurement validity
o Do the measures used match those you need?
o E.g., A manufacturing organisation recording monthly sales whereas you are
interested in monthly orders
o E.g., Use minutes of company meetings as a proxy for what actually happened in
those meetings
Coverage and unmeasured variables
o Do secondary data cover the population about which you need data, for the time
period you need, and contain variables that will enable you to answer the research
question?
o Some secondary data sets may not include variables you have identified as
necessary for your analysis (i.e., unmeasured variables)
Checklist:
Does the data set contain the information you require to answer your research
question(s)?
Do the measures used match those you require?
Is the data set a proxy for the data you really need?
Does the data set cover the population that is the subject of your research?
Does the data set cover the geographical area that is the subject of your research?
Can data about the population that is the subject of your research be separated from
unwanted data?
Are the data for the right time period or sufficiently up to date?
Are data available for all the variables you require to answer your research question(s)?
Are the variables defined clearly?
2. PRECISE SUITABILITY
Reliability and validity
o Quick option: Assess the authority or reputation of the source
o In-depth assessment:
Who is responsible for the data?
Method used to collect the data? (we can contact the person who
conducted the research and ask about the methodology)
Context in which the data were collected?
How were data analysed and reported?
Measurement bias: Can occur for two reasons
1) Deliberate or intentional distortion of data
E.g.,
Purpose of study is to reach a predetermined conclusion
E.g.,
People responding to a structured interview adjusting their
responses to please the interviewer
Triangulation!
2) Changes in the way data were collected
Particularly important for longitudinal data sets!
Checklist:
How reliable is the data set you are thinking of using?
How credible is the data source?
Is it clear what the source of the data is?
Do the credentials of the source of the data (author, institution or organisation
sponsoring the data) suggest it is likely to be reliable?
Question:
The sellers of a multimedia store visit competitive stores and write down their prices. This
observation is:
a) Natural - Personal - Not hidden
b) Manipulated - Mechanical - Not hidden
c) Manipulated - Personal Hidden
d) Natural - Personal - Hidden
Answer: d) natural personal hidden
PARTICIPANT OBSERVATION
Qualitative
Emphasis is on discovering the meaning that people attach to their actions.
Observation in which the researcher attempts to participate fully in or closely observe the
lives and activities of the research subjects and thus becomes a member of the subjects
group(s), organisation(s) or community
This enables researchers to share their experiences by not merely observing what is
happening but also feeling it
E.g., Street Corner Society by W.F. Whyte
The time you have to conduct the research will determine the role you would take as a researcher.
Complete participant:
Preventing social desirability
Raises questions of ethics
Might lose sight of research purpose
Observer as participant:
Able to focus on the researcher role
Lose the emotional involvement
Participant as observer:
Not always easy to gain trust of the group you observe
FACTORS THAT WILL DETERMINE THE CHOICE OF PARTICIPANT OBSERVER ROLE
Purpose of your research
Which role suits your research question?
E.g.: A phenomenon about which the research informants would be naturally
defensive is one that lends itself to the complete participant role
Organisational access
Ethical considerations
The degree to which you reveal your identity as the researcher or adopt a covert
stance will be dictated by ethical considerations
Data Analysis
Data from participant observation are analysed like other qualitative data (not part of this
course; see BBA3)
Data will start to be analysed at the time you collect them (i.e., data collection and data
analysis will be carried out simultaneously)
o Promising lines of enquiry that you wish to follow up in your continued observation
will emerge (cfr. analytic induction)
STRUCTURED OBSERVATION
Quantitative
Emphasis is on the frequency of actions
One of the key decisions you need to make before undertaking structured observation is
whether:
o to use an off-the-shelf coding schedule
o or to design your own coding schedule
Are the codes you have developed simple to understand and undemanding to apply so that
you will not need to memorise or check their meanings?
An alternative to the use of an off-the-shelf schedule or the development of your own may be a
combination of the two!
Data Analysis
The complexity of your analysis will depend on your research question
o It may be that you are using the coding schedule to establish the number of
interactions by category in order to relate the result to the output of the meeting.
Simple manual analysis may be sufficient for this purpose.
o Alternatively, you may be using the coding schedule to see what patterns emerge.
This level of analysis is more complex and will usually need statistical software.
Issues related to validity and reliability
See participant observation as well
Informant error: Errors that occur when informants are observed in situations that are
inconsistent with their normal behaviour patterns, leading to atypical responses
o E.g., You want to observe the amount of orders sales administrators process in a day
and choose administrators in a section that was short-staffed owing to illness
Time error: The time at which you conduct an observation provides data that are untypical
of the total time period in which you are interested
o E.g., The number of calls taken in a call centre is often higher in the hours
surrounding lunchtime in comparison to any other two-hour period
Advantages of Structure Observation
Can be used by anyone after suitable training in the use of the measuring instrument.
Therefore, you could delegate this extremely time-consuming task.
May be carried out simultaneously in different locations. This would present the opportunity
of comparison between locations.
Should yield highly reliable results by virtue of its replicability. The easier the observation
instrument is to use and understand, the more reliable the results will be.
Capable of more than simply observing the frequency of events. It is also possible to record
the relationship between events. For example, does a visit to a website lead to the
exploration of related pages and video recordings; does this lead to a decision to purchase?
Allows the collection of data at the time they occur in their natural setting. Therefore, there
is no need to depend on second-hand accounts of phenomena from participants who put
their own interpretation on events.
Secures data that most informants would ignore because to them these are too mundane or
irrelevant.
Disadvantages of Structure Observation
Unless virtual observation is used, the observer must be in the research setting when the
phenomena under study are taking place
Research results are limited to overt action or surface indicators from which the observer
must take inferences
Data are slow (and may be expensive) to collect
Question:
Which of the following statements is wrong?
A) Data based on observational studies can be used as secondary data
B) Structured observation might serve an explanatory research purpose
C) Collecting data through observation is part of qualitative research designs
D) Not revealing your researcher role in observational studies is advantageous as informants
probably behave in less socially desirable ways
Answer: C => Can also be quantitative. In exam explain why and with an example
A => Correct. But not very usual
B => Correct. We can investigate causal relationships between variables.
Next to the research purpose and the research strategy, which other factors determine when to use
qualitative research interviews?
The significance of establishing personal contact
The nature of the data collection questions
Length of time required and completeness of the process
The significance of establishing personal contact
People are more likely to agree to be interviewed rather than complete a questionnaire
An interview provides the opportunity to receive feedback and personal assurance about the
way in which information will be used
People may feel it is not appropriate to provide (sensitive and confidential) information to
someone they have never met
People may be reluctant to spend time providing written answers if the meaning of any
question is not entirely clear
More control over those who fill in the questionnaire
In sum, a checklist to help you decide whether to use qualitative research interviews
Does the purpose of your research suggests using semi-structured and/or in-depth
interviews?
Will it help to seek personal contact in terms of gaining access to participants and their data?
Are your data collection questions large in number, complex or open-ended?
Will there be a need to vary the order and logic of questioning?
Will it help to be able to probe interviewees responses to build on or seek explanation of
their answers?
Will the data collection process with each individual involve a relatively lengthy period?
Question:
Suppose you are an economist. You would like to do research among the CEOs of multinationals to
examine their opinion about opportunities and threats for the European economy during the next 10
years.
Question: What kind of interview would you undertake? Why?
Answer:
Qualitative, in order to explore more. Take a look at the target group. For a closed qualitative
questionnaire you wont have a lot of response and you wont be able to go much in depth.
Question:
Which of the following statements is wrong?
A) Structured interviews are suited for descriptive research purposes.
B) Qualitative research interviews allow to check respondents interpretation of questions.
C) Quantitative research interviews are more appropriate in case you have complex questions
compared to qualitative research interviews.
D) Unstructured interviews can be combined with quantitative data collection methods in the
same research project.
Answer: C => (Quantitative research interviews)
Its best to have qualitative because you can add more questions and expand the topic more.
Within business and management research, the greatest use of questionnaires is made
within the survey strategy. However, other research strategies (e.g., experiments) can make
use of these data collection methods as well.
Questionnaire = A general term to include all methods of data collection in which each
person is asked to respond to the same set of questions in a predetermined order
o Thus also includes, for instance, structured interviews (see Chapter 7), certain
telephone or online questionnaires,
POSTAL QUESTIONNAIRES
Populations characteristics for which suitable: Literate individuals who can be contacted by
post; selected by name, household, organisation
Confidence that right person has responded: Low
Likelihood of contamination or distortion of respondents answer: May be contaminated by
consultation with others
Size of sample: Large, can be geographically dispersed
Likely response rate: Variable, 30-50% reasonable
Feasible length of questionnaire: 6-8 A4 pages
Suitable types of question: Closed questions but not too complex; must be of interest to
respondent
Time taken to complete collection: 4-8 weeks from posting (dependent on number of followups).
Main financial resource implications: Outward and return postage, photocopying, clerical
support, data entry.
Role of the interviewer/field worker: None
Data input: Closed questions can be designed so that responses may be entered using
optical mark readers after questionnaire has been returned
Populations characteristics for which suitable: Literate individuals who can be contacted by
post; selected by name, household, organisation,
Confidence that right person has responded: Low
Likelihood of contamination or distortion of respondents answer: May be contaminated by
consultation with others
Size of sample: Dependent on number of field workers
Likely response rate: Variable, 30-50% reasonable
Feasible length of questionnaire: 6-8 A4 pages
Suitable types of question: Closed questions but not too complex; simple sequencing; must
be of interest to respondent
Time taken to complete collection: Dependent on sample size, number of field workers,
Main financial resource implications: Field workers, travel, photocopying, clerical support,
data entry
Role of the interviewer/field worker: Delivery and collection of questionnaires, enhancing
respondent participation
Data input: Closed questions can be designed so that responses may be entered using
optical mark readers after questionnaire has been returned
TELEPHONE QUESTIONNAIRES
Populations characteristics for which suitable: Individuals who can be telephoned; selected
by name, household, organisation,
Confidence that right person has responded: High
Likelihood of contamination or distortion of respondents answer: Occasionally distorted or
invented by interviewer
Size of sample: Dependent on number of interviewers
Likely response rate: High, 50-70% reasonable
Feasible length of questionnaire: Up to half an hour
Suitable types of question: Open and closed questions, including complicated questions;
complicated sequencing fine
Time taken to complete collection: Dependent on sample size, number of interviewers, but
slower than self-completed for same sample size
Main financial resource implications: Interviewers, telephone call, clerical support;
photocopying and data entry if not using CATI; programming, software, and computers if
using CATI (= computer-aided/assisted telephone interviewing)
Role of the interviewer/field worker: Enhancing respondent participation; guiding the
respondent through the questionnaire; answering respondents questions
Data input: Response to all questions entered at time of collection using CATI
STRUCTURED INTERVIEW
OPINION VARIABLES
Record how respondents feel about something or what they think or believe is true or false. Asking
for someones opinion.
Example: How do you feel about the following statement? Teachers at KUL should place their
students interests before their own.
Strongly disagree
Mildly disagree
Neither agree or disagree
Mildly agree
Strongly agree
BEHAVIOUR VARIABLES
Contain data on what people (or their organisations) did in the past, do now or will do in the future
(E.g.: What were you studying last year? What toothpaste do you use?)
Example: Did you ever went to Paris?
Yes
No
ATTRIBUTE VARIABLES
Contain data about the respondents characteristics; Attributes are best thought of as things a
respondent possesses, rather than things a respondent does. They are, among others, used to
explore how opinions and behavior differ between respondents as well as to check that the data
collected are representative of the total population (see Chapter 4 Sampling)
Used to link to an opinion/behaviour variable. E.g.: Gender => Opinion about
Example: What is your gender?
Female
Male
Question:
What type of variable (opinion, behaviour or attribute) is measured by means of the following
question?
What is your marital status?
Single
Married or living in long-term relationship
Widowed
Divorced
Other (Please describe)
Answer: Attribute
Question:
What type of variable (opinion, behaviour or attribute) is measured by means of the following
question?
Do you agree or disagree with the right to tourism?
Strongly agree
Agree
Neither agree, nor disagree
Disagree
Strongly disagree
Answer: Opinion
Deciding what data need to be collected
For most business and management research, the data you collect using questionnaires will
be used for either:
Descriptive purposes
o It is important that you select the appropriate characteristics to answer your
research question. Therefore, you will need to have:
- Reviewed the literature carefully
- Discussed your ideas with colleagues, project tutor and other
interested parties
Explanatory purposes
o You need to be clear about which relationships you think are likely to exist
between variables:
o Dependent variable(s)? Independent variable(s)? Mediating variable(s)?
Moderating variable(s)?
Independent variable (1): Variable that causes changes to a dependent variable or variables
Dependent variable (3): Variable that changes in response to changes in other variables
Mediating variable (2): A variable that transmits the effect of an independent variable to a
dependent variable
Moderating variable (4): A variable that affects the relationship between an independent
variable and a dependent variable
Step 5: Establish the level of detail required from the data for each variable.
Details are important as they influence the statistical technique of the research.
Investigative question
= Do customers feel that they should have an outside smoking area at restaurants
and bars as a right? (Opinion)
Variable required
= Opinion of customers on restaurants and bars providing an outside smoking area
as a right
Detail in which data measured
= Feelshould be a right, should not be a right, no strong feelings (NB will need s
separate questions for restaurants and for bars)
Investigative question
= Do customers opinion differ depending on age? (Attribute)
Variable required
= Opinion of customers on restaurants and bars providing an outside smoking area
as a right
= Age of employee
Detail in which data measured
= Feelshould be a right, should not be a right, no strong feelings
= To nearest 5-year band (youngest 16, oldest 65+)
Investigative question
= Do customers opinion differ depending on whether or not they smoke?
(Behaviour)
Variable required
= Opinion of customers on restaurants and bars providing an outside smoking area
as a right
= Smoking behaviour
Detail in which data measured
= Feelshould be a right, should not be a right, no strong feelings
= Non-smoker, smokes but not in own home, smokes in own home
Step 6: Develop measurement questions to capture the data at the level of data required for each
variable.
Detail in which data measured
= Feelshould be a right, should not be a right, no strong feelings (NB will need
separate questions for restaurants and for bars)
Measurement question:
Do you feel that smoking in restaurants:
Should be a right
Should not be a right
No strong feelings
Question:
Develop a data requirements table for the following research objective:
To establish KUL-students opinion about vegetarian meals in student restaurants.
Data requirements table
So every question in your questionnaire is essential (i.e., each question should be relevant for your
research)
There are, however, some exceptions:
o First question
o Cushion questions (threatening questions) e.g.: asking people about sensitive subjects like
salary. Start by asking general questions about the topic.
o Control questions (A question to make sure that the interviewer is really reading the
questions. E.g.: Do not tick any box. If you have people who ticked the box, you can exclude
these people from the research. This should be reported.
- Trace inconsistency
- Interviewer fraud
o Filler items (reveal the true nature) Questions completely unrelated to the research. In doing
this the participant is not aware of the topic of the research.
Ok, you have an idea about the data you should collect by means of your questionnaire. In other
words, you have an idea about the data you should collect to, in the end, be able to answer your
research question(s).
=> Now, you should consider how to design those individual questions
Designing individual questions
The reliability and validity of the data you collect depend, to a large extent, on the design of
your questions
A valid questionnaire will enable accurate data that actually measure the concepts you are
interested in to be collected, whilst one that is reliable will mean that these data are
collected consistently.
The design of each question should be determined by the data you need to collect (see slide
about data requirements table)
When designing individual question, researchers do one of three things:
- Adopt questions used in other questionnaires
- Adapt questions used in other questionnaires
- Develop their own questions
Adopting and adapting questions used in other questionnaires
May be necessary if you wish to replicate or to compare your findings with another
study (reliability can be assessed)
More efficient than developing your own questions
Questions might be validated already
When making individual questions, make neutrals questions so that participants are not guided in a
certain direction.
Question types
The population or the background of the people can influence the type of question.
Open or open-ended questions allow respondents to give answers in their own way
Closed or closed-ended or forced-choice questions provide a number of alternative answers
from which the respondent is instructed to choose
Open questions: An example
Please list up three things that you like about KUL:
1
2
3
OPEN QUESTIONS
Are widely used in unstructured and semi-structured interviews (see Chapter 7)
In questionnaires they are useful:
o If you are unsure of the response (e.g., exploratory research)
o When you require a detailed answer
o When you want to find out what is uppermost in the respondents mind
The precise wording of the question and the amount of space partially determine the length
and fullness of the response
When questionnaires are returned by large numbers of respondents, responses to open
questions are extremely time-consuming to code (keep the use of open question to a
minimum)
CLOSED QUESTIONS
Compared to open questions:
o Usually quicker and easier to answer as they require minimal writing
o Responses are easier to compare as they have been predetermined (If these
responses cannot be easily interpreted, then these benefits are marginal)
We discuss six types of closed question:
1. List questions
4. Rating questions
2. Category questions
5. Quantity questions
3. Ranking questions
6. Matrix questions
1. List Questions
The respondent is offered a list of items, of which they can choose one or more items
Example: Which benefits do you receive in your job next to your salary? Please tick the
appropriate box(es).
Mobile phone
13th month
Laptop
Car
Free internet at home
Food cheques
Public transport
Other (Please say:..)
Dont know
Useful when you need to be sure that the respondent has considered all possible responses
(However, the list of responses must be defined clearly and meaningfully to the respondent)
For structured interviews, it is often helpful to present the respondent with a prompt card
listing all responses
Extra items can be added
o Does not apply
o Dont know - Not sure
o Other
What in case of unmarked response?
o Are often inferred as negative responses
o Non-response could also indicate uncertainy or, for some questions, that an item
does not apply
2. Category Questions
Where only one response can be selected from a given set of categories
Example: Examine the pictures portraying several male celebrities below? Which celebrity do
you think is the most attractive?
Brad Pitt
Orlando Bloom
Chris Martin
Are particularly useful if you need to collect data about behavior or attributes
The number of categories that you can include without affecting the accuracy of responses is
dependent on the type of questionnaire (self-completed and telephone questionnaires <
structured interviews using a prompt card)
Arrange responses in a logical order so that it is easy to locate the response category that
corresponds to respondents answer
Categories should be mutually exclusive and should cover all possible responses
3. Ranking Questions
Where the respondent is asked to place something in rank order (you can discover the
relative importance to the respondent)
Example: Please number each of the factors listed below in order of importance to you in
your choice of a new car. Number the most important 1, the next 2 and so on.
Factor
Importance
Carbon dioxide emissions
Boot size
Depreciation
Safety features
Fuel economy
Price
Driving enjoyment
You need to make sure that the instructions are clear and will be understood by the
respondent
In general, ranking more than seven items takes too much effort and reduces motivation to
complete the questionnaire
Respondents can rank accurately only when they can see or remember all items.
o This can be overcome with face-to-face questionnaires by using prompt card.
o Telephone questionnaires should ask respondents to rank fewer items, as the
respondent will need to rely on their memory
Can be combined with a list question: List-ranking question
Example: Please number each of the factors listed below in order of importance to you in
your choice of a new car. Number the most important 1, the next 2 and so on. If a factor has
no importance at all, please leave blank.
Factor
Carbon dioxide emissions
Boot size
Depreciation
Safety features
Fuel economy
Price
Driving enjoyment
Importance
4. Rating Questions
In which a rating device is used to record responses
Often used to collect opinion data
Three types of rating question that are often used:
o Likert-style rating question
o Numeric rating question
o Semantic differential rating question
Likert-style rating question
Allows the respondent to indicate how strongly she or he agrees or disagrees with a
statement
Often used to examine attitudes, importance, intentions
Usually on a four-, five-, six- or seven-point rating scale
Points are accompanied by a verbal description (sometimes also a number)
Even number of points forces the respondent to choose, an odd number of points allows to
choose the middle not sure category
6. Matrix Questions
Grid questions; Enable you to record the responses to two or more (similar) questions at the
same time using the same grid
Saves space
However, it is suggested that respondents may have difficulties comprehending these
designs and that they are a barrier to response Clear instructions might be a solution
Thermometer Scale
Please indicate how much you like McDonalds hamburgers by coloring in the thermometer. Start at
the bottom and color up to the temperature level that best indicates how strong your preference is.
Question:
What type of question is the question below?
Please describe what you think is the main reason why students study at KUL
Answer: Open Question
Question:
What type of question is the question below?
What is your age?
Less than 20 years
21-40 years
41-60 years
More than 60 years
Answer: Category question (you can only select one)
Question:
What type of question is the question below?
How many computers do you have at home?
____ computers
Answer: Self-coded question
DESIGNING INDIVIDUAL QUESTIONS
When designing your individual questions, you should consider several important
questions!
o Will respondents have the necessary knowledge to answer your question? Does the
respondent has the requested information?
o Does your question appear to talk down to respondents? Are there any words in
your question that might cause offence? Is your question likely to embarrass the
respondent?
o Are the words used in your question familiar to the respondents? Will all
respondents understand the questions in the same way? Do you use vague words
that are open to all sorts of interpretation (e.g., sometimes)?
o Are there any words that sound familiar and might be confused with those used in
your question? Are there any words that look similar and might be confused if your
question is read quickly?
o Can your question be shortened?
o Are you asking more than one question at the same time?
o Does your question include a negative or double negative?
o Is your question unambiguous?
o Does your question imply that a certain answer is correct? (leading questions)
Central tendency effect
During last week, how many hours have you watched television per day (on average)?
o Less than 1 hour
o Between 1 hour and less than 2 hours
o Between 2 hours and less than 3 hours
o Between 3 hours and less than 4 hours
o 4 hours or more
People tend to choose the average answer. Social desirability problem. (Validity)
Are the instructions on how to answer and interpret each question clear?
In case of a structured interview, include instructions for the interviewer!
(e.g., Read aloud all possible responses)
Introduce new topics (e.g., The following questions refer to)
Does the respondent has to put lots of effort in providing the information?
Example: Van Kenhove (1989) 152 students: How many cups of coffee did you had on
average each day during last week?
People tend to avoid answering a question if they have to think too much.
Does your the respondent remembers the requested information?
o The extent to which respondents remember facts from the past depends on:
- The importance of the event: Routine and involvement
- The time between the event and the question
- The extent to which respondents memory is stimulated
Question:
What is wrong with the following question?
1. Would you rather not eat non-vegetarian dishes?
2. When did you stop smoking?
3. Many students believe their advisors are not helping them with their master theses. Do you
believe this to be the case?
4. What is the income of your parents?
5. Do you have a good relationship with your mother and father?
6. Does the government should prohibit promotion of tobacco products?
Answer:
1. Non-vegetarian dishes => Too negative
2. Stop smoking => Should indicate year
3. Advisor =>leading question. Most respondents would say yes
4. Income => might not have the knowledge
5. Relationship mother and father => 2 questions in one
6. Tobacco => Leading question
Ok, you have designed your individual questions and the wording of each of these questions is
correct.
Now, you should consider how to construct the questionnaire
Sometimes it is necessary to deviate from the main principle order above. E.g.: Quota sample
Also,
Make the questions simple and not leading.
Start with simple questions followed by more difficult ones.
Least threatening to most threatening.
CONSTRUCTING THE QUESTIONNAIRE : ORDER AND FLOW
Are questions at the beginning of your questionnaire more straightforward and ones the
respondent will enjoy answering? (introduce the questionnaire with some introductory and
easy questions) Are questions at the beginning of the questionnaire obviously relevant to
the stated purpose of your questionnaire?
Are questions and topics that are more complex placed towards the middle of your
questionnaire?
Are personal and sensitive questions towards the end of your questionnaire, and is their
purpose clearly explained?
Are questions grouped into obvious sections that will make sense to the respondent?
ORDER EFFECTS
The effect of previously asked questions on the responses of subsequent questions
o Example:
Have you passed your driving test?
Did you have a good driving instructor?
Funneling = General specific questions
CONSTRUCTING THE QUESTIONNAIRE : ORDER AND FLOW
Consider the layout of the questionnaire. This is important for both interviewer- and selfcompleted questionnairs:
o Make reading questions easy
o Make filling in responses easy
o Attractive to encourage to fill in and return
o Not appear too long (see previous class / use of matrix questions)
o Keep the visual appearance simple
When constructing your questionnaire, do not forget to explain the purpose of the
questionnaire. Next to developing a covering letter or email (affects the response rate), you
should:
o Introduce the questionnaire
Why you want the respondent to complete the survey
Interviewer-administered questionnaire: Short introduction that interviewer
reads to respondent
Prepare answers to some questions the respondent might ask you
o Close the questionnaire
Thanking respondent
Providing contact details for any queries
When/How/Where to return
Ok, you have constructed the questionnaire.
Prior to administering your questionnaire to collect data, it should be tested first.
The purpose is to refine the questionnaire so that respondents will have no problems in
answering the questionnaire. In addition, it will enable you to obtain some assessment of
reliability and validity.
o Expert(s) to comment on your questionnaire
o Pilot test (trial run) among a smaller group (similar to population)
The number of people and pilot tests depends on research question, size of
research project, time and money resources and quality of initial
questionnaire design
You should find out:
o How long the questionnaire took to complete
o The clarity of instructions
o Which, if any, questions were unclear or ambiguous
o Which, if any, questions the respondent felt uneasy about answering
o Whether in their opinion there were any major topic omissions
o Whether the layout was clear and attractive
o Any other comments
In case of interviewer-completed questionnaires, you should find out whether:
o There are any questions for which visual aids should have been provided
o They have difficulty in finding their way through the questionnaire
o They are recording answers correctly
So now the data are entered into a data matrix (e.g., SPSS data set) either manually (after coding) or
electronically/automatically (e.g., by using Qualtrics). You thus have raw data to be analysed.
In general, each column of this data matrix represents a variable, while each row represents a case.
Case 1
Case 2
Case 3
ID
1
2
3
Chapter 9: Experiments
Commonly used to infer causal relationships
Simply stated, to infer whether a change in one or more independent variables produces
a change in one or more dependent variables.
Pre-test
measurement
of purchasing
behaviour
Post-test
measurement
of purchasing
behaviour
Y (dep)
Attitude towards promotion
Mood
Positive
Negative
If there is a difference between O22 and O21, this is caused by extraneous variables
The difference between O12 and O11 is also influenced by these extraneous variables, but also
by the intervention
Consequently, the effect of the independent variable:
= (O12 O11) (O22 O21)
Experimental group
Control group
(Manipulation + third variable) Third variable = Manipulation
CLASSIC EXPERIMENT
Reactive error:
The artificiality of the lab environment may cause reactive error in that participants react to the
situation itself rather than to the independent variable
Demand artefacts:
A phenomenon in which participants attempt to guess the purpose of the experiment and respond
accordingly.
For example, while viewing a film clip, participants may recall pre-treatment questions about the
brand and guess that the advertisement is trying to change their attitudes towards the brand.
Internal validity:
To draw valid conclusions about the effects of independent variables on the study group.
External validity:
To make valid generalisations
INTERNAL VALIDITY
Refers to whether the manipulation or treatment of the independent variables actually
caused the observed effects on the dependent variables. Refers to whether the observed
effects on the test units could have been caused by variables other than the treatment.
If the observed effects are influenced or confounded by extraneous variables, it is difficult to
draw valid inferences about the causal relationship between the independent and
dependent variables.
Internal validity is the basic minimum that must be present in an experiment before any
conclusion about treatment effects can be made.
Control of extraneous variables is a necessary condition for establishing internal validity.
EXTERNAL VALIDITY
Refers to whether the cause-and-effect relationships found in the experiment can be
generalised. Can the results be generalised beyond the experimental situation and, if so, to
what populations, settings, times, independent and dependent variables can the results be
projected?
Threats to external validity arise when the specific set of experimental conditions does not
realistically take into account the interactions of other relevant variables in the real world.
To control for extraneous variables, a researcher may conduct an experiment in an artificial
environment. This enhances internal validity, but may limit external validity.
Regardless of the deterrents to external validity, if an experiment lacks internal validity, it
may not be meaningful to generalise the results.
THREATS TO VALIDITY
Factors that threaten internal validity may also threaten external validity, the most serious
of these being extraneous or confounding variables as they offer alternative explanations.
Some examples of extraneous variables (which can occur jointly and also interact with each
other):
o History
o Maturation
o Testing effects
o Instrumentation
o Selection bias
o Mortality
Threats to validity: History
= Specific events that are external to the experiment but that occur at the same time as the
experiment
The longer the time interval between observations, the greater the possibility that history
will confound the results.
Example:
o You measure ticket sales for a Christmas market before and after a new promotional
campaign. You find no difference between the ticket sales.
o Was the promotional campaign ineffective? What if, for instance, general economic
conditions declined during the experiment and the local area was particularly hard
hit by redundancies through several employers closing down their operations?
Threats to validity: Maturation
= An extraneous variable attributable to changes in the test units themselves that occur with the
passage of time
In an experiment involving people, maturation takes place as people become older, more
experienced, tired, bored or uninterested. (they might also think differently)
Studies that span several months are vulnerable to maturation, since it is difficult to know
how participants are changing over time.
Maturation effects also extend to test units other than people. For instance, travel
companies changing over time in terms of personnel, physical layout, decoration,
THREATS TO VALIDITY : TESTING EFFECTS
= Effects caused by the process of experimentation
An example
An experiment measuring the effect of advertising on attitudes towards taking a holiday in Egypt.
You measure the attitudes before and after the advertisement. If there is no difference between the
pre- and post-treatment attitude, it might be that the participants tried to maintain consistency
among these two attitudes.
How to change the above design to examine the presence of such interaction effects?
Solomon Four group design:
One way to solve this, is by means of a double blind experiment: Both the test units and the
researcher conducting the experiment are not aware of the goal of the experiment
THREATS TO VALIDITY : SELECTION BIAS
= An extraneous variable attributable to the improper assignment of participant to treatment
conditions
This bias occurs, among others, when the selection or assignment of participants results in
treatment groups that differ on the dependent variable before the exposure to the
treatment condition.
Suppose that age influences health and that all older people in your study are attributed to
the placebo condition (i.e., the control condition) and not to the medicine condition.
Suppose that health is poorer in the control condition compared to the medicine condition.
Is this result due to the placebo or rather due to the age variable?
In other words, make sure that age is equally distributed among the different
experimental conditions (i.e., the control condition and the medicine condition)
If you are not allowed to or if you are not able to apply randomisation, you can ensure similarity of
both experimental and control group by means of the matching procedure
Matching
= A method of controlling extraneous variables that involves matching participants on a set of key
background variables before assigning them to the treatment conditions.
This method has some drawbacks:
1. Participants can be matched on only a few characteristics, so the participants may be
similar on the variables selected but unequal on others.
2. If the matched characteristics are irrelevant to the dependent variable, then the
matching effort has been futile.
3. Matching is only possible in case you have all relevant information for all test units
Statistical control
= A method of controlling extraneous variables by measuring the extraneous variables and
adjusting for their effects through statistical methods
E.g., ANCOVA = Analysis of Covariance
Manipulation (check)
Suppose you want to examine the influence of a negative versus a positive mood on the
attitude towards an advertisement. You run a laboratory experiment. Respondents are
confronted with one of two film clips aimed at inducing either a negative or positive mood
state (i.e., two experimental groups, no control group).
In this experiment, there must be a manipulation check to test whether respondents being
confronted with the negative mood film clip indeed have a negative mood while
respondents being confronted with the positive mood film clip indeed have a positive
mood. If not, this is a threat to the interval validity of your experiment!
Design control
= A method of controlling extraneous variables that involves using specific experimental designs
Each experimental design has its own advantages and disadvantages, also in relation to
confounding variables
However, the goal of this chapter is not to explain all the possible experimental designs
EXPERIMENTAL DESIGNS
Up til now, we talked about some experimental designs in which the influence of one
particular manipulation/intervention/independent variable with one level is examined.
However, there are also experimental designs in which the effect of more than one level
and/or more than one independent variable is examined. (Some of these designs also not
include a before-measurement)
COMPLETELY RANDOMISED DESIGN
The effect of one independent variable with different levels is examined (i.e., different
experimental groups are attributed to different experimental treatments with regard to
one variable)
Example: The effect of three different promotional campaigns (coupon, gift, extra amount of
product) on sales
o At random, you select 27 similar supermarkets that sell Snickers
o At random, these supermarkets are divided in three groups of 9 supermarkets (9
coupon, 9 gift, 9 extra amount)
o After one month, the actual sales is measured and compared between the three
experimental groups
FACTORIAL DESIGNS
The effect of more than one independent variable with each variable having more than
one level is examined
Example: The effect of two different promotional campaigns (coupon, gift) and two package
colours (red, yellow) on sales
o = 2x2 between-subjects design (i.e., 2 factors with each two levels). In other
words, there are 4 experimental conditions. Respondents are randomly assigned to
one condition.
2 campaigns, 2 colours
Types of campaign
2 levels
Sales
Colour
2 levels
BETWEEN-SUBJECTS DESIGN VS . WITHIN-SUBJECTS DESIGN
Between-subjects design: Participants belong to one and only one experimental condition.
For example
Between-subjects design
o Advantages
Different levels of the independent variable will not influence each other
o Disdavantages
Test units in the different conditions can be different in terms of relevant
variables
Within-subjects design: Each test unit is assigned to each experimental condition. For this reason,
this approach is known as repeated measures.
For example
Within-subjects design
o Advantages
May be more practical than a between-subjects design because it requires
fewer participants
No differences between the different groups in your design
o Disdavantages
May lead to carryover effects where familiarity or fatigue with the process
distorts the validity of the findings
Question:
Suzan is a marketing researcher. She is interested in examining the impact of advertisements
depicting celebrities on consumer behavior. Based on an extensive literature review, she formulates
the following hypothesis:
Intentions to buy sporting clothes are higher when endorsed by male celebrities compared to female
celebrities, while intentions to buy healthy food are higher when endorsed by female celebrities
compared to male celebrities.
a) What is the dependent variable in Suzans research?
b) Draw a bar chart that represents the hypothesis of this research. Define each element of this
line chart.
c) How many experimental conditions are included in Suzans experimental design? Explain
your answer.
d) Suppose that Suzan opts for a between-subjects design, while her colleague opts for a
within-subjects design. What is the difference between those two experiments?
e) Develop a rating question that enables Suzan to measure the dependent variable
Answer:
(Ind) Gender of celebrities
Male
Female
Type of product
Sportswear
Healthy Food
Intention to buy
4
3
2
1
0
Male celebrities
Female celebrities
Gender of celebrities
Sportswear
Healthy food
Question:
You are doing research on the influence of specific positive emotions (pride versus happiness) on the
extent to which people behave ethically. In other words, you are wondering whether people behave
ethically to a different extent when they are feeling pride compared to happy.
Design a lab experiment that would enable you to examine this research topic. How would you
measure the independent and dependent variables?
Answer: