Academia.eduAcademia.edu

Survey Sampling

description2,105 papers
group17,327 followers
lightbulbAbout this topic
Survey sampling is the process of selecting a subset of individuals from a larger population to estimate characteristics of the whole population. It involves techniques to ensure that the sample is representative, minimizing bias and allowing for valid inferences about the population based on the sample data.
lightbulbAbout this topic
Survey sampling is the process of selecting a subset of individuals from a larger population to estimate characteristics of the whole population. It involves techniques to ensure that the sample is representative, minimizing bias and allowing for valid inferences about the population based on the sample data.

Key research themes

1. How can probability and non-probability sampling methods be optimized to ensure representativeness and reliability in survey sampling?

This research theme focuses on understanding the strengths, limitations, and methodological innovations of both probability and non-probability sampling methods, including cutting-edge designs like voluntary sampling, cutoff sampling, and respondent-driven sampling (RDS). It addresses challenges such as sampling frame limitations, nonresponse bias, and estimation accuracy to guide researchers in selecting and implementing sampling strategies that maximize representativeness and reliability in diverse research settings.

Key finding: This paper elucidates the critical distinction between sampling and census-based studies, emphasizing that sample representativeness is crucial for valid statistical inference. It identifies key challenges like sampling bias... Read more
Key finding: This study introduces a novel non-probability 'voluntary sampling' design tailored for sensitive or complex surveys. By publicly announcing the survey intent in advance, it creates strata of volunteers and non-volunteers,... Read more
Key finding: This work systematically reviews various cutoff sampling schemas—take-all, take-some, take-none strata—and their integration with model-based and design-based inference. It demonstrates that combining certainty strata with... Read more
Key finding: This paper critically examines the assumption of sampling with replacement in Respondent-Driven Sampling (RDS) estimators through simulations and proofs. It finds that, contrary to expectations, bias due to this assumption is... Read more
Key finding: The chapter differentiates qualitative sampling strategies from quantitative ones, emphasizing theoretical and purposeful sampling to select information-rich cases rather than statistical representativeness. It clarifies the... Read more

2. What advancements in integrating auxiliary and big data sources improve precision and bias reduction in survey sampling?

This research theme investigates novel statistical methodologies that leverage auxiliary information, administrative records, satellite or GPS data, and big data analytics to enhance survey sample design and estimation. Focused on model-assisted and nonparametric methods, data integration approaches, and spatial and machine learning techniques, this theme explores how these data fusion strategies improve estimator efficiency, correct biases from incomplete sampling frames or nonresponse, and enable smaller, more precise samples with reliable inference.

Key finding: This paper proposes a novel sampling method utilizing GPS and aerial/satellite imagery to construct cluster samples where traditional population enumerations are unavailable or risky. The method accounts for building density... Read more
Key finding: The study applies flexible nonparametric regression techniques—including neural networks, regression splines, and generalized additive models—to model complex relationships between survey variables and multivariate auxiliary... Read more
Key finding: This special issue synthesizes interdisciplinary research at the nexus of big data and survey science, highlighting methodologies that combine passive data streams (e.g., sensor and social media data) with survey data to... Read more
Key finding: The authors develop calibrated estimators based on Ordered Response Randomized Techniques (ORRT) to address simultaneous nonresponse and measurement errors in stratified successive sampling designs. The proposed estimators... Read more

3. How can advancements in technology and analytical techniques improve survey implementation and quality in specialized contexts?

This theme explores the application of technological innovations such as web-based sampling, machine learning, and plotless distance-based methods to enhance survey execution and data analysis. Emphasis is on web-based respondent-driven sampling for hidden populations, artificial intelligence models for predictive surveys, and alternative ecological survey designs that reduce cost and improve precision. These approaches address operational challenges, increase data reliability, and expand the scope of feasible survey contexts.

Key finding: This article provides a comprehensive framework for implementing WebRDS, demonstrating that online adaptations of RDS can achieve effective recruitment and data collection among hard-to-reach populations. It discusses the... Read more
Key finding: This study applies neural networks and AI algorithms, specifically Group Method of Data Handling (GMDH), to improve the forecasting accuracy of coking coal quality parameters in mining operations. Through historical data from... Read more
Key finding: This empirical assessment compares the ordered distance method (ODM) and point-centred quarter method (PCQM) for tree density and basal area estimation in forest stands with varied spatial patterns. Results show PCQM provides... Read more

All papers in Survey Sampling

Some social surveys address sensitive topics for which respondents do not report reliable responses. Randomized response techniques (RRTs) are employed to increase privacy levels and provide honest answers. However, estimates obtained... more
Near-cutoff sampling for multiple-attribute establishment surveys can be very useful. When the same data items are in an occasional census as those in more frequent samples, testing and accurate production of Official Statistics is... more
Various uses of cutoff sampling are shown. (Near-cutoff sampling with ratio model prediction can work very well for Official Statistics.) - Poster for a speed session for the International Conference on Establishment Statistics ICES... more
"Knaub, J.R., Jr. (2017), 'Comparison of Model-Based to Design-Based Ratio Estimators,' JSM Proceedings, Survey Research Methods Section, American Statistical Association, was on this topic and focused on fundamentals, whereas here we... more
Les formations végétales en Algérie sont façonnées par une combinaison de facteurs climatiques, édaphiques et anthropozoïques, qui influencent continuellement leur physionomie. Cette flore se développe sous un climat typiquement... more
In applying the jackknife procedure to stratified samples, jackknifed variances are often estimated based on overall cross-strata pseudo means. The stratum weights employed in estimating overall cross-strata pseudo means may deviate from... more
Jiahe Qian This paper discusses minimum mean square error (MSE) estimation to a population in the context of stratified simple random sampling. We have derived adjusted stratum weights that minimize the MSE of the weighted sample mean.... more
El Análisis de Redes es un enfoque de estudio de reciente creación y popularización, al grado que cada día son más las disciplinas científicas sociales que lo aplican a su materia de estudio. Su perspectiva especial consiste en el uso... more
The problem mentioned here is to obtain unbiased estimate of the proportion of people having sensitive characteristics like accumulated savings, intentional tax evasion, drunken driving etc. in a society. Warner presented a technique... more
The Franciscana dolphin ( Pontoporia blainvillei), a small cetacean endemic to southwestern Atlantic coastal waters, is the most endangered marine mammal species in the south Atlantic. In the Espírito Santo State, in southeastern Brazil,... more
A wide variety of local weight and land measurement units in Bangladesh. These are considered a major problem in registering land records and in pricing and marketing agricultural commodities. This study reports the findings of a... more
The central limit theorem is central to modern statistics, particularly for sampling theory. Despite what you may have heard, it hasn’t been proven mathematically in relation to samples, unless they approach infinity in size, which... more
In this paper, we study, within a modeling framework, the joint treatment of nonignorable dropout and informative sampling for longitudinal survey data, by specifying the probability distribution of the observed measurements when the... more
Objectives: Cycle 5 of the National Survey of Family Growth (NSFG) was conducted by the National Center for Health Statistics (NCHS) in 1995. The NSFG collects data on pregnancy, childbearing, and women's health from a national sample of... more
Boxplots showing distribution of all antecedent dry periods and the antecedent dry periods for runoff events during which runoff samples were collected at U.S. Geological Survey monitoring stations on State Route 2A in Boston... more
The normalized odd Collatz map is analyzed as a discrete dynamical system in the ring of 2-adic integers. It is proven that the existence of periodic macro-cycles is equivalent to the solvability of a Diophantine system governed by an... more
Social surveys generally assume that a sample of units (students, individuals, employees, . . . ) is observed by two-stage selection from a finite population, which is grouped into clusters (schools, household, companies, . . . ). This... more
The calibration method has been widely discussed in the recent literature on survey sampling, and calibration estimators are routinely computed by many survey organizations. The calibration technique was introduced in [12] to estimate... more
The convenience of online surveys has quickly increased their popularity for data collection. However, this method is often non-probabilistic as they usually rely on selfselection procedures and internet coverage. These problems produce... more
This study introduces a general framework on inference for a general parameter using nonprobability survey data when a probability sample with auxiliary variables, common to both samples, is available. The proposed framework covers... more
ipfweight performs a stepwise adjustment (known as iterative proportional fitting or raking) of survey sampling weights to achieve known population margins. The iterative process is repeated until the difference between the sample margins... more
The controversy surrounding the Mandatory Country-of-Origin Labeling (COOL) has attracted research attentions. A number of studies have reported consumers are willing to pay more for beef labeled with U.S. origin versus beef from unknown... more
Applications of several adjustment methods and the Without Replacement Bootstrap (BWO) are presented, using data from the 1997 Annual Business Survey, conducted by Portugal's National Statistics Institute. The application of these methods... more
This article describes the prevalence of multi-morbidity and its association with socioeconomic and demographic factors using National Sample Survey 2017-18 data, on 42756 older adults aged 60+. The prevalence of multimorbidity is... more
• Meaning and Importance of Population and Sample • Meaning and Principles of Sampling • Probability Sampling designs – Simple Random, Stratified Random, Clustered Sampling • Non-probability Sampling Designs - Purposive, Convenient and... more
Ranked set sampling (RSS) is an important survey technique aimed at efficient estimation of population characteristics. Various RSS methods can be used to collect an RSS sample, but the reproducibility of estimates using these methods... more
This paper investigates the impact of minimum wages on wages and employment in Greece between 2009 and 2017. Our main contribution is the examination of the effects of minimum wages under a dramatically changing context, as during this... more
The paper considers a Finnish human survey that takes advantage of explicit stratification. Stratification is not ordinary, since there are two types of stratification. In one, the strata are municipality based that are much used earlier.... more
This memorandum responds to a request from the House Subcommittee on Census and Population for information concering the planning for the 1990 Census. It identifies eleven major issues concerning the implementation of the 1990 Census: (1)... more
Breve sintesi storica, dalle due lettere che Sagredo scrive a Galileo nel 1612 e nel 1613 e che documentano i primordi nella misura della temperatura, alla scala assoluta di misura della temperatura introdotta da Kelvin.
Local species richness and between-site similarity in species composition of parasitoid wasps (Hymenoptera: Ichneumonidae; Pimplinae and Rhyssinae) were correlated with those of four plant groups (pteridophytes, Melastomataceae,... more
This paper presents the first record of Naucrates ductor ( Linnaeus 1758) from Syrian waters. One specimen (300 mm TL, 294.29 g TW) was caught by purse-seine nets at about 60 m depth from Lattakia coast, on 25 September 2020. This record... more
Respondent driven sampling (RDS) is a relatively new network sampling technique typically employed for hard-to-reach populations. Like snowball sampling, initial respondents or "seeds" recruit additional respondents from their... more
This Paper aims to test the causal relationship between the Volatility Index (VIX) and global gold prices during the period from 4 January 2021 to 28 August 2025, using Granger causality testing within a vector autoregressive (VAR) model.... more
conjunction with a nationwide reorganization of access to research data, such that five other service and research centres were set up / established almost at the same time: at the Federal Statistical Office (in Wiesbaden and Bonn), the
Using a set of random telephone and Internet (web− based) survey samples for a national advisory referendum, we implement Beta models to handle proportional budget information, and allow for consistency in modeling assumptions and the... more
0 1 1 ( ) ( ) ( ) 0 e e e     . in both the cases I and II. The other expected values ignoring finite population correction (fpc) terms in case I and case II are given by, Case I: Case II: 2 1 1 ( ) x e C n   , 2 0 1 ( ) x e e CC... more
This paper proposes two ratio and product-type estimators using transformation based on known minimum and maximum values of auxiliary variable. The biases and mean squared errors of the suggested estimators are obtained under large sample... more
This paper is an attempt to develop an estimator for finite population mean. Motivated by , a ratio in ratio type exponential strategy is developed for estimation of population mean in double sampling for stratification. To compare with... more
This paper addresses the problem of estimating the finite population mean of the study variable y in the presence of auxiliary attributes. An improved class of estimators for population mean has been defined along with properties under... more
Use of auxiliary information has been in practice to improve the efficiency of the estimators of parameters. Ratio, product and regression methods are good examples of use of auxiliary information. Ratio, product and regression type... more
This paper is an attempt to develop an estimator for finite population mean. Motivated by Kiregyera (1984), a ratio in ratio type exponential strategy is developed for estimation of population mean in double sampling for stratification.... more
A ratio estimator is proposed for the ratio of two population means using auxiliary information in stratified random sampling. Bias and mean squared error expressions are obtained under large sample approximation, and the proposed... more
proposed a general family of estimators for population mean using known value of some population parameters in simple random sampling. The objective of this paper is to propose a family of combined-type estimators in stratified random... more
Download research papers for free!