Academia.eduAcademia.edu

Community Size and Network Closure

2010

Abstract

Community size is often found to be negatively correlated with prosocial behaviors such as formal volunteering, working on public projects and informal help to friends and strangers (Putnam 2000, p. 119, 206). This may be because people who reside in large communities simply spend less time socializing with each other. As a result, people living in large cities have on average fewer friends, and hence their social networks support less cooperation. 1 A complementary channel, which has received less attention in the literature, is that community size may affect outcomes by changing other aspects of the network structure. Specifically, even holding fixed the number of friends, we expect social networks in small communities to exhibit greater network closure, i.e., be more interconnected. The intuition is straightforward: in small communities, the pool of potential friends is limited, which increases the extent to which the network neighborhoods of two friends are likely to overlap. suggested that variation in network closure can directly affect outcomes. In particular, he argued that networks with higher closure generate high trust between friends, which facilitates cooperation and thus improves welfare. The logic is that networks with high closure allow for greater social sanctions between individuals through common friends, thus increasing incentives for cooperation. In this paper, we empirically evaluate the effect of community size on network closure in one specific social environment: schools. We make use of the National Longitudinal Study of Adolescent Health (AddHealth) which has detailed social network information for about 86,000 students in 142 US middle and high schools. Our main finding is a strong negative

Community Size and Network Closure Hunt Allcott Dean Karlan Markus M. Möbius Adam Szeidl ∗ Tanya Rosenblat January 10, 2007 Community size is often found to be negatively correlated with prosocial behaviors such as formal volunteering, working on public projects and informal help to friends and strangers (Putnam 2000, p. 119, 206). This may be because people who reside in large communities simply spend less time socializing with each other. As a result, people living in large cities have on average fewer friends, and hence their social networks support less cooperation.1 A complementary channel, which has received less attention in the literature, is that community size may affect outcomes by changing other aspects of the network structure. Specifically, even holding fixed the number of friends, we expect social networks in small communities to exhibit greater network closure, i.e., be more interconnected. The intuition is straightforward: in small communities, the pool of potential friends is limited, which increases the extent to which the network neighborhoods of two friends are likely to overlap. Coleman (1990) suggested that variation in network closure can directly affect outcomes. In particular, he argued that networks with higher closure generate high trust between friends, which facilitates cooperation and thus improves welfare. The logic is that networks with high closure allow for greater social sanctions between individuals through common friends, thus increasing incentives for cooperation. In this paper, we empirically evaluate the effect of community size on network closure in one specific social environment: schools. We make use of the National Longitudinal Study of Adolescent Health (AddHealth) which has detailed social network information for about 86,000 students in 142 US middle and high schools. Our main finding is a strong negative ∗ Harvard University and National Bureau of Economic Research, 1050 Massachusetts Avenue, Cambridge, MA 02138, USA, hunt [email protected]; Department of Economics, Yale University, New Haven, CT 06520, USA, [email protected]; Department of Economics, Harvard University and National Bureau of Economic Research, Cambridge, MA 02138, USA, [email protected]; Department of Economics, Wesleyan University, Middletown, CT 06459, USA, [email protected]; and Department of Economics, UC-Berkeley, Berkeley, CA 94720, USA, [email protected]. We thank Avinash Dixit for comments. 1 For example, Jacobs (1961) criticized city planning of the 1950s and 1960s with its emphasis on large blocks because it prevented many types of social interactions common in small cities, such as convening on sidewalks and small squares. 1 relationship between grade size and network closure. We also find that grade size is negatively associated with prosocial behavior.2 We then explore whether this community size effect on outcomes can be partially attributed to the network closure channel. I Theory and Estimation Framework Community Size and Network Structure. Community size can affect network structure by influencing the evolution of friendship links between agents. To see the logic, imagine that the social network is generated by a random process where (a) agents’ preferences for socializing with other agents are i.i.d. draws from a known distribution and (b) agents are more likely to interact with those who are around them (see e.g., Jackson and Rogers (2006)). With such a process, the degree distribution measuring the number of friends per agent will be independent of community size. However, the friends of an agent in a small community will likely be friends with each other, because there are few other potential friends. This generates overlapping neighborhoods and high network closure. In contrast, in a large community the friends of an agent are likely to have different friends, leading to non-overlapping neighborhoods and low closure. This model predicts that the social networks in small communities should exhibit higher network closure. A Measure of Network Closure. To empirically assess the magnitude of the above channel, we will use the measure of network closure developed by Mobius and Szeidl (2006). We now explain and provide intuition for this measure. First we introduce the idea of a trust flow, which measures the degree to which agents in a social network can cooperate. Define the trust flow between two agents s and t to be the number of disjoint paths in the network connecting them, where all edges in a path must be within distance K from agent s.3 Here K governs the size of the “circle of trust,” i.e., the set of agents who can be used to enforce cooperation. Intuitively, high trust flow means that s and t have better incentives to cooperate because they have access to greater social sanctions over each other should one of them defect. To measure network closure for a given circle of trust K, consider an agent s, and compute the trust flow between s and all other agents in the network. Let the sum of these pairwise 2 In the education literature, there is substantial agreement that smaller schools are associated with better outcomes (Cotton 2001). Possible mechanisms beyond the closure channel include teacher attitudes (Lee and Loeb 2000), improved safety (Wasley, Fine, Gladden, Holland, King, Mosak and Powell 2000), decisionmaking autonomy (ibid), and more parent involvement (Mitchell 2000). In economics, Hoxby (2000) and Angrist and Lavy (1999) show the connection between class size and student achievement. 3 The distance between an agent and an edge is the average of the distances between the agent and the two endpoints of the edge. 2 flows be the total trust of agent s, which measures how much cooperation can s expect from the network. For any positive integer m, compute the share of this total trust that comes from pairwise trust flows exceeding m. The resulting number which we refer to as (m, K)-closure of agent s, is a measure of the closure of the network around agent s. Loosely speaking, high (m, K)-closure can be interpreted as s having high pairwise trust with a small number of agents, while lower closure means having low pairwise trust with a large number of agents. It is important to note that closure can vary across networks even if the network size and degree distribution is held fixed. For example, figure 1 shows two networks where each of the 6 agents have two friends. However, agent s has lower closure in the network on the left. To see this formally, let K = 1.5; the trust flows between s and all other agents for this K are shown in the figure. In both networks, s enjoys a total trust of 4. However, in the left network this total trust comes from low pairwise trust flows of 1 from four other agents, while in the right network it comes from high pairwise trust flows of 2 from two other agents. Hence agent s has (1, K)-closure of 0 in the left network, and (1, K)-closure of 1 in the right network. Is high or low closure better for agent s? That depends on what he wants to use the network for. If he needs the network for valuable transactions, then high closure is better. While high closure provides access to only a few agents, s can ask valuable favors from all of these. In contrast, if s needs to engage in transactions of lower value, he prefers the greater access to people provided by the low-closure network. Estimation Framework. To evaluate the effect of community size on network closure, we will estimate equations of the form (1) closure = α + β · community size + γ · controls + ε where β measures the causal effect of community size on network closure, ε captures unobserved variation and measurement error, and the controls refer to other social and economic factors that can influence network structure. One important control variable is an agent’s number of friends, also called degree. Degree matters because it can be related to both community size and closure. More specifically, community size may affect the network structure through influencing degree: For example, if people in small communities invest more in social interactions, they end up with more friends. Alternatively, people in large communities may find a bigger potential market for friends, and as a result develop more connections. This effect of community size on degree can bias the estimation of (1) because degree is also positively correlated with closure: networks with higher average degree tend to be more 3 interconnected. To deal with this difficulty, we include flexible controls for degree in our specifications. We will also be interested in the effects of the variation in closure generated by community size on social and educational outcomes. To assess the importance of this channel, we will first estimate the total effect of community size on outcomes using the following specification: (2) outcome = δ + θ · community size + γ · controls + ν. Here the total effect of size is measured by θ. To assess how much of this effect is due to variation in network closure, we then estimate (3) outcome = δ  + θ · community size + ρ · closure + γ  · controls + ν  . If part of the effect of size on outcomes is mediated through network closure, then we expect that the absolute value of the coefficient on size falls as we include closure in the regression (|θ | < |θ|). Moreover, using the estimated coefficients in these regressions, we can measure the relative contribution of the network closure channel in the total effect of size on outcomes. This analysis, like others using school size as an independent variable, is frustrated by the lack of exogenous variation in grade size. Our estimation procedure requires that the error terms in regressions (1), (2) and (3) are orthogonal to the key size and closure variables. One possible concern with this assumption is that in equation (1) size may be correlated with closure beyond its causal effect, for example if size is related to other environmental factors that affect network structure. Similarly, in equation (3) closure may be correlated with the outcome variable through an omitted variable, like the sociability of the school’s teachers. We attempt to alleviate theses concerns by including a rich set of controls, such as social and economic factors, in all regressions. However, we cannot fully rule out alternative explanations, and hence we interpret the findings from the above analysis about closure and outcomes as suggestive evidence. II Data Description The National Longitudinal Study of Adolescent Health (AddHealth) surveyed every student in a representative sample of 142 US public and private middle and high schools in 1994 and 1995 (Udry 2003). Each of these 85,627 students reported demographics, recent grades and extracurricular activities, health outcomes and their feelings about their school environment. As part of the survey students were asked to name up to 10 friends - 5 male and 5 female. 4 We use this information to construct an “OR-social network” where a direct link between two students exists if one of them names the other one as a friend. We obtain our core sample by excluding 7,779 students who are in connected components of size smaller than 20 (7,138 form clusters of size 1). This is because network statistics become less meaningful for such small components; in particular, closure is undefined for clusters of size 1. We compute our trust flow measure for K = 2.0 for all pairs of agents in the core sample, as well as the (m, K)-closure measure for m = 2 developed above.4 We also compute network degree, the number of neighbors for each agent. We measure community size by the number of students in the student’s grade at her school.5 We use categorical indicators to control for four observables: the student’s grade (6 to 12), her ethnicity (white or non-white), her father’s education (less than high-school, high-school, college) and her school’s location (urban, sub-urban or rural). We also include controls for fractionalization in some regressions. For grade, ethnicity, and education, fractionalization is the probability that two people chosen randomly from the cluster will be of different categories. We focus on three outcome variables: TROUBLE is the student’s response to the question, “Since school started this year, how often have you had trouble getting along with other students?” with 0 being “never” and 4 being “everyday.” FEELSAFE is the student’s response to the question, “How strongly do you agree or disagree with the following statement: ‘I feel safe in my school’,” which we have coded from 1 (“strongly disagree”) to 5 (“strongly agree”). GPA is the average of the student’s most recent grades in english, math, history, and science. III Results Community Size and Network Closure. We begin with a graphical analysis. Figure 2 plots our measure of network closure for schools with grade sizes above and below the median size, holding fixed a student’s number of friends (degree). The figure provides preliminary evidence for our main hypothesis by showing that in our data, large schools tend to have lower network closure. It is important to note that this correlation between size and closure holds even when we control for number of friends. While degree and closure are correlated with each other, grade size generates variation in closure beyond its effect on the degree 4 Using m = 1 does not affect our qualitative results, nor does using K = 2.5. In most American high schools, students attend multiple classes with different groups of students, so we expect their social networks to be influenced by school size rather than by classroom size. Our focus on school size distinguishes the current paper from the body of econometric work on class size. 5 5 distribution. These observations are also reflected in the regression analysis in Table 1, where we estimate (1) in different specifications. Increasing the number of students in a grade is correlated with lower closure both in the simple ordinary least squares (OLS) regressions as well as in specifications where we include fixed effects for the number of friends of a student. Including a rich set of controls such as grade composition within the connected component, school location, parents’ education, racial composition as well as demographic fractionalization does not change our results. The fixed effect estimates suggest that a one standard deviation increase in grade size is associated with a decrease in closure by about 0.12 of a standard deviation. Closure and Outcomes. Table 2 presents estimates of equations (2) and (3) for our three outcome variables (FEELSAFE, TROUBLE and GPA). In all specifications we include fixed effects for number of friends, as well as the full set of controls used in the fourth column of table 1. Columns FS1 and GPA1 show that students in larger schools feel less safe and also have significantly lower GPA. These results are consistent with the notion that grade size is negatively related to educational and social outcomes. However, somewhat surprisingly, there is a negative relationship between having trouble with other students and grade size (column TR1). A one-standard deviation increase in grade size is associated with about a 0.04 standard deviation decrease in the FEELSAFE measure, and a 0.06 standard deviation decrease for GPA. To assess the extent to which the effect of grade size on outcomes is mediated through network closure, we then include closure in the right hand side of our regressions. As columns FS2, TR2 and GPA2 show, closure is positively correlated with good outcomes: the coefficient of closure is strongly significant and has the right sign in all three specifications. Moreover, focusing on the FEELSAFE and GPA outcomes where grade size predicts worse outcomes, we also find that including closure significantly reduces the grade size coefficient. The drop in the effect of grade size is between one fourth and one third for these two outcomes. These results suggest that a substantial part of the effect of size on FEELSAFE and GPA is mediated through the closure channel. We interpret our outcome estimates as suggestive evidence supporting Coleman’s (1990) claim that higher closure generates prosocial behavior. Mobius and Szeidl (2006) predicts such a positive relationship if agents engage in high-value transactions within the network. Coming to the help of fellow students or helping friends who lag behind academically might qualify as such “high-value” favors. 6 IV Conclusion In this paper we argue that community size can affect outcomes by influencing the interconnectedness of social networks. Using the AddHealth dataset, we have shown that there is a strong association between grade size and network closure, which is robust to controlling for the degree distribution of the social network. We also provided suggestive evidence that this relationship between size and closure can influence trust-related outcomes. Unfortunately, the AddHealth dataset contains a small number of outcomes where the role of the social network is likely to be important. It would be interesting to apply our methodology to data with more outcome variables. In particular, we propose dividing the analysis by “high-value” and “low-value” services provided by the social network (such as financial support versus information exchanged with a social network). This would allow us to test whether networks with lower closure can sometimes provide greater utility to agents who have a demand for “low-value” services (as suggested by Burt (1995)). 7 References Angrist, Josh and Victor Lavy, “Using Maimonides’ Rule to Estimate the Effect of Class Size on Scholastic Achievement,” Quarterly Journal of Economics, 1999, 114, 533–575. Burt, Ronald S., Structural Holes: The Social Structure of Competition, Cambridge: Harvard University Press, 1995. Coleman, James S., Foundations of Social Theory, Cambridge: Harvard University Press, 1990. Cotton, Kathleen, “New Small Learning Communities: Findings from Recent Literature,” Working Paper, Northwest Regional Educational Laboratory 2001. Hoxby, Caroline, “The Effects of Class Size on Student Achievement: New Evidence from Population Variation,” Quarterly Journal of Economics, 2000, 115, 1239–1285. Jackson, Matthew and Brian Rogers, “Meeting Strangers and Friends of Friends: How Random are Socially Generated Networks?,” forthcoming: American Economic Review, 2006. Jacobs, Jane, The Death and Life of Great American Cities, New York: Random House, 1961. Lee, Valerie E. and Susanna Loeb, “School Size in Chicago Elementary Schools: Effects on Teachers’ Attidues and Students’ Achievement,” American Educational Research Journal, 2000, 37, 3–31. Mitchell, Stacy, “Jack and the Giant School,” The New Rules, 2000, 2, 1–10. Mobius, Markus M. and Adam Szeidl, “Trust and Social Collateral,” working paper, Harvard University and UC Berkeley September 2006. Putnam, Robert D., Bowling Alone: The Collapse and Revival of American Community, New York: Simon & Schuster, 2000. Udry, J. Richard, “The National Longitudinal Study of Adolescent Health (Add Health), Waves I & II, 19941996; Wave III, 20012002,” machine-readable data file and documentation, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 2003. Wasley, Patricia A., Michelle Fine, Matt Gladden, Nicole E. Holland, Sherry P. King, Esther Mosak, and Linda C. Powell, Small Schools: Great Strides. A Study of New Small Schools in Chicago, New York: The Bank Street College of Education, 2000. 8 Table 1: OLS and fixed effects regressions of network (2, 2)-closure on grade size. OLS FE FE-controls Fract-controls (1) (2) (3) (4) Students in Grade -0.040 -0.032 -0.026 -0.026 (0.0007)∗∗ Const. Obs. R2 (0.0004)∗∗ 0.822 0.800 (0.0005)∗∗ (0.0005)∗∗ 0.770 0.722 (0.002)∗∗ (0.001)∗∗ (0.002)∗∗ (0.011)∗∗ 77,848 0.036 77,848 0.069 77,848 0.095 77,848 0.095 Standard errors in parentheses. See the text for definition of (2, 2)-closure. Fixed effect is for number of friends of student. Unit of grade size is 100 students. In column 3 we control for student’s grade, school location (rural, suburban and urban), parents’ education and racial composition. In column 4 we additionally include the demographic fractionalization controls. **Significantly different from 0 at the 1-percent level. Table 2: Effect of grade size on outcomes for FEELSAFE (FS), TROUBLE (TR) and GPA, without and with (2, 2)-closure. FS1 FS2 TR1 TR2 GPA1 GPA2 (1) (2) (3) (4) (5) (6) (2,2)-Closure 0.315 -0.415 0.427 (0.025)∗∗ Students in Grade Const. Obs. R2 -0.029 (0.003)∗∗ 3.747 (0.033)∗∗ -0.021 -0.021 (0.003)∗∗ (0.004)∗∗ 3.512 1.697 -0.032 (0.004)∗∗ 2.005 (0.018)∗∗ -0.033 (0.002)∗∗ 2.688 -0.022 (0.002)∗∗ 2.368 (0.075)∗∗ (0.078)∗∗ (0.098)∗∗ (0.101)∗∗ (0.056)∗∗ (0.058)∗∗ 69,042 0.022 69,042 0.024 73,780 0.04 73,780 0.042 68,220 0.047 68,220 0.054 Standard errors in parentheses. All columns include fixed effects for number of friends. Unit of grade size is 100 students. All regressions include controls for student’s grade, school location (rural, suburban and urban), parents’ education, racial composition, as well as demographic fractionalization. **Significantly different from 0 at the 1-percent level. 9 0 0 1 1 0 0 1 1 2 2 s s H igh closure Low closure 0 .2 (2,2)−Closure .4 .6 .8 1 Figure 1: Low closure (left) and high closure (right) social networks with identical link distribution (K = 1.5). Agent s enjoys the same amount of average trust in both networks in low closure network she can ask 4 agents for favors worth at most 1 while in high closure network she can ask 2 agents for favors worth at most 2. 0 5 10 Number of Friends Below Median Size 15 20 Above Median Size Figure 2: Mean (2, 2)-closure by number of friends for schools with grade size below median size (solid line) and schools with grade size above median size (dashed). See the text for definition of (2, 2)-closure. 10

References (13)

  1. Angrist, Josh and Victor Lavy, "Using Maimonides' Rule to Estimate the Effect of Class Size on Scholastic Achievement," Quarterly Journal of Economics, 1999, 114, 533-575.
  2. Burt, Ronald S., Structural Holes: The Social Structure of Competition, Cambridge: Harvard University Press, 1995.
  3. Coleman, James S., Foundations of Social Theory, Cambridge: Harvard University Press, 1990.
  4. Cotton, Kathleen, "New Small Learning Communities: Findings from Recent Literature," Working Paper, Northwest Regional Educational Laboratory 2001.
  5. Hoxby, Caroline, "The Effects of Class Size on Student Achievement: New Evidence from Population Variation," Quarterly Journal of Economics, 2000, 115, 1239-1285.
  6. Jackson, Matthew and Brian Rogers, "Meeting Strangers and Friends of Friends: How Random are Socially Generated Networks?," forthcoming: American Economic Review, 2006.
  7. Jacobs, Jane, The Death and Life of Great American Cities, New York: Random House, 1961.
  8. Lee, Valerie E. and Susanna Loeb, "School Size in Chicago Elementary Schools: Effects on Teachers' Attidues and Students' Achievement," American Educational Research Journal, 2000, 37, 3-31.
  9. Mitchell, Stacy, "Jack and the Giant School," The New Rules, 2000, 2, 1-10.
  10. Mobius, Markus M. and Adam Szeidl, "Trust and Social Collateral," working paper, Harvard University and UC Berkeley September 2006.
  11. Putnam, Robert D., Bowling Alone: The Collapse and Revival of American Community, New York: Simon & Schuster, 2000.
  12. Udry, J. Richard, "The National Longitudinal Study of Adolescent Health (Add Health), Waves I & II, 19941996; Wave III, 20012002," machine-readable data file and documentation, Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 2003.
  13. Wasley, Patricia A., Michelle Fine, Matt Gladden, Nicole E. Holland, Sherry P. King, Esther Mosak, and Linda C. Powell, Small Schools: Great Strides. A Study of New Small Schools in Chicago, New York: The Bank Street College of Education, 2000.
About the author
Northwestern University, Faculty Member
Papers
144
Followers
9
View all papers from Dean Karlanarrow_forward