"

Data Collection and Analysis

Measurement[1]

How do we know that our measures are good? Without some assurance of the quality of our measures, we cannot be certain that our findings have any meaning or, at the least, that our findings mean what we think they mean. When social scientists measure concepts, they aim to achieve reliability and validity in their measures. These two aspects of measurement quality are the focus of the this section. We will consider reliability first and then take a look at validity. For this section, imagine we are interested in measuring the concepts of alcoholism and alcohol intake. What are some potential problems that could arise when attempting to measure this concept, and how might we work to overcome those problems?

Reliability

First, suppose a researcher has decided to measure alcoholism by asking people to respond to the following question: Have you ever had a problem with alcohol? If we measure alcoholism in this way, it seems likely that anyone who identifies as an alcoholic would respond with a yes to the question. So this must be a good way to identify our group of interest, right? Well, maybe. Think about how you or others you know would respond to this question. Would responses differ after a wild night out from what they would have been the day before? Might a teetotaller’s current headache from the single glass of wine he had last night influence how he answers the question this morning? How would that same person respond to the question before consuming the wine? In each of these cases, if the same person would respond differently to the same question at different points, it is possible that our measure of alcoholism has a reliability problem. Reliability in measurement is about consistency. If a measure is reliable, it means that if the same measure is applied consistently to the same person, the result will be the same each time.

One common problem of reliability with social scientific measures is memory. If we ask research participants to recall some aspect of their own past behavior, we should try to make the recollection process as simple and straightforward for them as possible. Sticking with the topic of alcohol intake, if we ask respondents how much wine, beer, and liquor they have consumed each day over the course of the past three 3 months, how likely are we to get accurate responses? Unless a person keeps a journal documenting their intake, there will very likely be some inaccuracies in their responses. If, on the other hand, we ask a person how many drinks of any kind he or she has consumed in the past week, we might get a more accurate set of responses. Reliability can be an issue even when we are reliant upon individuals to accurately report their behaviors.

We can look at another.  Perhaps a field researcher is interested in observing how alcohol intake influences interactions in public locations. She may decide to conduct observations at a local pub, noting how many drinks patrons consume and how their behavior changes as their intake changes. But what if the researcher needs to use the restroom and misses the three shots of tequila that the person next to her downs during the brief period she is away? The reliability of this researcher’s measure of alcohol intake, counting numbers of drinks she observes patrons consume, depends upon her ability to actually observe every instance of patrons consuming drinks. If she is unlikely to be able to observe every such instance, then perhaps her mechanism for measuring this concept is not reliable.

Validity

While reliability is about consistency, validity is about shared understanding. What image comes to mind for you when you hear the word alcoholic? Are you certain that the image you conjure up is similar to the image others have in mind? If not, then we may be facing a problem of validity.

To be valid, we must be certain that our measures accurately get at the meaning of our concepts. Think back to the first possible measure of alcoholism we considered in the subsection “Reliability.” There, we initially considered measuring alcoholism by asking research participants the following question: Have you ever had a problem with alcohol? We realized that this might not be the most reliable way of measuring alcoholism because the same person’s response might vary dramatically depending on how he or she is feeling that day. Likewise, this measure of alcoholism is not particularly valid. What is “a problem” with alcohol? For some, it might be having had a single regrettable or embarrassing moment that resulted from consuming too much. For others, the threshold for “problem” might be different; perhaps a person has had numerous embarrassing drunken moments but still gets out of bed for work every day and he therefore does not perceive himself as having a problem. Because what each respondent considers to be problematic could vary so dramatically, our measure of alcoholism is not likely to yield any useful or meaningful results if our aim is to objectively understand, say, how many of our research participants are alcoholics.

Here is another example. Perhaps we are interested in learning about a person’s dedication to healthy living. Most of us would probably agree that engaging in regular exercise is a sign of healthy living, so we could measure healthy living by counting the number of times per week that a person visits his local gym. At first this might seem like a reasonable measure, but if this respondent’s gym is anything like some of the gyms, there exists the distinct possibility that his gym visits include activities that are decidedly not fitness related. Perhaps he visits the gym to use the tanning beds, not a particularly good indicator of healthy living, or to flirt with potential dates or sit in the sauna. These activities, while potentially relaxing, are probably not the best indicators of healthy living. Therefore, recording the number of times a person visits the gym may not be the most valid way to measure his or her dedication to healthy living. Using this measure would not really give us an indication of a person’s dedication to healthy living and therefore, we would not really be measuring what we intended to measure.

Indeed, in the social sciences it is often not as straightforward as A causes B in the classic experiments.  Frequently there are many other variables that may occur at the same time as A and/or B cause both A and B. Therefore, a researcher must be careful to ensure that his or her study has internal validity …that it does, in fact, test the very thing it seeks to test. There are several threats to internal validity (e.g. history, maturation, testing, regression to the mean, selection biases, and instrumentation) and ways to control for these types of threats (e.g. experiment and the use of a control or comparison groups.

Researchers usually also want external validity, meaning that they want their study to be generalizable to other situations and contexts, beyond the current project. They also want it to reflect real world environments where the phenomena occur and to prove that it was not due to chance that they got the findings they did. As Palys and Atchison (2014) state, it does not, necessarily, have anything to do with the representativeness of the sample.  Rather, it depends upon the nature of the phenomenon under study and on the research objectives.

At its core, validity is about social agreement. One quick and easy way to help ensure that your measures are valid is to discuss them with others. One way to think of validity is to think of it as you would a portrait. Some portraits of people look just like the actual person they are intended to represent. But other representations of people’s images, such as caricatures and stick drawings, are not nearly as accurate. While a portrait may not be an exact representation of how a person looks, what’s important is the extent to which it approximates the look of the person it is intended to represent. The same goes for validity in measures. No measure is exact, but some measures are more accurate than others.

Complexities in Measurement

You should now have an idea about how to assess the quality of your measures. But measurement is a complex process, and some concepts are more complex than others. Measuring a person’s political party affiliation, for example, is less complex than measuring her or his sense of alienation. In this section we will consider some of these complexities in measurement. First, we will examine the various levels of measurement that exist, and then we will consider a couple of strategies for capturing the complexities of the concepts we wish to measure.

LEVELS OF MEASUREMENT

When social scientists measure concepts, they sometimes use the language of variables and attributes. A variable refers to a grouping of several characteristics. Attributes are those characteristics. A variable’s attributes determine its level of measurement. There are four possible levels of measurement; they are nominal, ordinal, interval, and ratio.

NOMINAL MEASUREMENT

At the nominal level of measurement, variable attributes meet the criteria of exhaustiveness and mutual exclusivity. This is the most basic level of measurement. Relationship status, gender, race, political party affiliation, and religious affiliation are all examples of nominal-level variables. For example, to measure relationship status, we might ask respondents to tell us if they are currently partnered or single. These two attributes pretty much exhaust the possibilities for relationship status (i.e., everyone is always one or the other of these), and it is not possible for a person to simultaneous occupy more than one of these statuses (e.g., if you are single, you cannot also be partnered).  Therefore, this measure of relationship status meets the criteria that nominal-level attributes must be exhaustive and mutually exclusive. One unique feature of nominal-level measures is that they cannot be mathematically quantified. We cannot say, for example, that being partnered has more or less quantifiable value than being single (note we are not talking here about the economic impact of one’s relationship status— we are talking only about relationship status on its own, not in relation to other variables).

ORDINAL MEASUREMENT

Unlike nominal-level measures, attributes at the ordinal level can be rank ordered, though we cannot calculate a mathematical distance between those attributes. We can simply say that one attribute of an ordinal-level variable is more or less than another attribute. Ordinal-level attributes are also exhaustive and mutually exclusive, as with nominal-level variables. Examples of ordinal-level measures include social class, degree of support for policy initiatives, television program rankings, and prejudice. Thus, while we can say that one person’s support for some public policy may be more or less than his neighbor’s level of support, we cannot say exactly how much more or less.

INTERVAL MEASUREMENT

At the interval level, measures meet all the criteria of the two preceding levels, plus the distance between attributes is known to be equal. IQ scores are interval level, as are temperatures. Interval-level variables are not particularly common in social science research, but their defining characteristic is that we can say how much more or less one attribute differs from another. We cannot, however, say with certainty what the ratio of one attribute is in comparison to another. For example, it would not make sense to say that 50 degrees is half as hot as 100 degrees.

RATIO MEASUREMENT

Finally, at the ratio level, attributes are mutually exclusive and exhaustive, attributes can be rank ordered, the distance between attributes is equal, and attributes have a true zero point. With these variables, we can say what the ratio of one attribute is in comparison to another. Examples of ratio-level variables include age and years of education. We know, for example, that a person who is 12 years old is twice as old as someone who is six years old.

Another point to consider when designing a research project, and which might differ slightly in qualitative and quantitative studies, has to do with units of analysis and units of observation. These two items concern what you, the researcher, actually observe in the course of your data collection and what you hope to be able to say about those observations. The table below provides a summary of the differences between units of analysis and observation.

unit of analysis is the entity that you wish to be able to say something about at the end of your study, probably what you would consider to be the main focus of your study. A unit of observation is the item (or items) that you actually observe, measure, or collect in the course of trying to learn something about your unit of analysis. In a given study, the unit of observation might be the same as the unit of analysis, but that is not always the case. Further, units of analysis are not required to be the same as units of observation. What is required, however, is for researchers to be clear about how they define their units of analysis and observation, both to themselves and to their audiences. More specifically, your unit of analysis will be determined by your research question. Your unit of observation, on the other hand, is determined largely by the method of data collection that you use to answer that research question.

To demonstrate these differences, let us look at the topic of students’ addictions to their cell phones. We will consider first how different kinds of research questions about this topic will yield different units of analysis. Then we will think about how those questions might be answered and with what kinds of data. This leads us to a variety of units of observation.

If I were to ask, “Which students are most likely to be addicted to their cell phones?” our unit of analysis would be the individual. We might mail a survey to students on a university or college campus, with the aim to classify individuals according to their membership in certain social classes and, in turn, to see how membership in those classes correlate with addiction to cell phones. For example, we might find that students studying media, males, and students with high socioeconomic status are all more likely than other students to become addicted to their cell phones. Alternatively, we could ask, “How do students’ cell phone addictions differ and how are they similar?  In this case, we could conduct observations of addicted students and record when, where, why, and how they use their cell phones.  In both cases, one using a survey and the other using observations, data are collected from individual students. Thus, the unit of observation in both examples is the individual. But the units of analysis differ in the two studies.  In the first one, our aim is to describe the characteristics of individuals.  We may then make generalizations about the populations to which these individuals belong, but our unit of analysis is still the individual.  In the second study, we will observe individuals in order to describe some social phenomenon, in this case, types of cell phone addictions. Consequently, our unit of analysis would be the social phenomenon.

Another common unit of analysis in sociological inquiry is groups. Groups of course vary in size, and almost no group is too small or too large to be of interest to sociologists. Families, friendship groups, and street gangs make up some of the more common microlevel groups examined by social scientists. Employees in an organization, professionals in a particular domain (e.g., chefs, lawyers, sociologists), and members of clubs (e.g., Girl Scouts, Rotary, Red Hat Society) are all meso-level groups that social scientists might study. Finally, at the macro level, social scientists sometimes examine citizens of entire nations or residents of different continents or other regions.

A study of student addictions to their cell phones at the group level might consider whether certain types of social clubs have more or fewer cell phone-addicted members than other sorts of clubs. Perhaps we would find that clubs that emphasize physical fitness, such as the rugby club and the scuba club, have fewer cell phone-addicted members than clubs that emphasize cerebral activity, such as the chess club and the sociology club. Our unit of analysis in this example is groups. If we had instead asked whether people who join cerebral clubs are more likely to be cell phone-addicted than those who join social clubs, then our unit of analysis would have been individuals. In either case, however, our unit of observation would be individuals.

Organizations are yet another potential unit of analysis that social scientists might wish to say something about. Organizations include entities like corporations, colleges and universities, and even night clubs. At the organization level, a study of students’ cell phone addictions might ask, “How do different colleges address the problem of cell phone addiction?” In this case, our interest lies not in the experience of individual students but instead in the campus-to-campus differences in confronting cell phone addictions. A researcher conducting a study of this type might examine schools’ written policies and procedures, so his unit of observation would be documents. However, because he ultimately wishes to describe differences across campuses, the college would be his unit of analysis.

Social phenomena are also a potential unit of analysis. Many social scientists study a variety of social interactions and social problems that fall under this category. Examples include social problems like murder or rape; interactions such as counseling sessions, Facebook chatting, or wrestling; and other social phenomena such as voting and even cell phone use or misuse. A researcher interested in students’ cell phone addictions could ask, “What are the various types of cell phone addictions that exist among students?” Perhaps the researcher will discover that some addictions are primarily centered around social media such as chat rooms, Facebook, or texting while other addictions center single-player games that discourage interaction with others. The resultant typology of cell phone addictions would tell us something about the social phenomenon (unit of analysis) being studied. As in several of the preceding examples, however, the unit of observation would likely be individual people.

Finally, a number of social scientists examine policies and principles, the last type of unit of analysis we will consider here. Studies that analyze policies and principles typically rely on documents as the unit of observation. Perhaps a researcher has been hired by a college to help it write an effective policy against cell phone use in the classroom. In this case, the researcher might gather all previously written policies from campuses all over the country and compare policies at campuses where the use of cell phones in classroom is low to policies at campuses where the use of cell phones in the classroom is high.

In sum, there are many potential units of analysis that a sociologist might examine, but some of the most common units include the following:

  1. Individuals
  2. Groups
  3. Organizations
  4. Social phenomena
  5. Policies and principles
Units of analysis and units of observation: A hypothetical study of students’ addictions to cell phones
Research question Unit of analysis Data collection Unit of observation Statement of findings
Which students are most likely to be addicted to their cell phones? Individuals Survey of students on campus Individuals Media majors, men, and students with high socioeconomic status are all more likely than other students to become addicted to their cell phones.
Do certain types of social clubs have more cell phone -addicted members than other sorts of clubs? Groups Survey of students on campus Individuals Clubs with a scholarly focus have more cell phone-addicted members than more socially focused clubs.
How do different colleges address the problem of addiction to cell phones? Organizations Content analysis of policies Documents Campuses without policies prohibiting cell phone use in the classroom have high levels of cell phone addiction.
What are the various types of cell phone addictions that exist among students? Social phenomena Observations of students Individuals There are two main types of cell phone addictions: social and antisocial
What are the most effective policies against cell phone addiction? Policies and principles Content analysis of policies and student records Documents Policies that require students with cell phone addictions to attend group counselling for a minimum of one semester have been found to treat addictions more effectively than those that call for expulsion of addicted students.

Want to Learn More About Data Collection?

Want to learn more about data collection and analysis?  Once you have determined your variables and units of measurement, you can begin to think about how you will collect and analyze your data.  Check out these helpful resources:

 


  1. Adapted from Sheppard, V. (2020). An introduction to research methods in sociology: Measurement and units of analysis. BCcampus. Retrieved from https://pressbooks.bccampus.ca/researchmethods/part/measurement-and-units-of-analysis/. Licensed under CC BY-NC-SA 4.0.

License

PSY-250 Research Paper Guidelines and Resources Copyright © by David Adams. All Rights Reserved.