CFP last date
20 May 2024
Reseach Article

A Neoteric Data Preprocessing Technique for Online Surveys

by Akshay R, Arti Arya
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 163 - Number 11
Year of Publication: 2017
Authors: Akshay R, Arti Arya
10.5120/ijca2017913761

Akshay R, Arti Arya . A Neoteric Data Preprocessing Technique for Online Surveys. International Journal of Computer Applications. 163, 11 ( Apr 2017), 9-12. DOI=10.5120/ijca2017913761

@article{ 10.5120/ijca2017913761,
author = { Akshay R, Arti Arya },
title = { A Neoteric Data Preprocessing Technique for Online Surveys },
journal = { International Journal of Computer Applications },
issue_date = { Apr 2017 },
volume = { 163 },
number = { 11 },
month = { Apr },
year = { 2017 },
issn = { 0975-8887 },
pages = { 9-12 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume163/number11/27437-2017913761/ },
doi = { 10.5120/ijca2017913761 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:09:54.209898+05:30
%A Akshay R
%A Arti Arya
%T A Neoteric Data Preprocessing Technique for Online Surveys
%J International Journal of Computer Applications
%@ 0975-8887
%V 163
%N 11
%P 9-12
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Online surveys is an essential research tool that are being applied in variety of research fields, including marketing, social and official statistics research and hence are one of the most popular data collection technique. Some people fill it genuinely and some do it randomly. Data collected through samples that are not filled genuinely may affect the analysis of the collected samples considerably. This paper proposes a preprocessing technique to select the samples that have genuine responses in order to make sure the final data collected from the survey is more precise and accurate. For this purpose the time duration an individual takes to provide his/her opinion to each question in questionnaire is captured. This captured time is used to check the percentage of questions that fall between the time ranges computed for each question using the proposed algorithm to indicate if the sample was filled genuinely. In doing so the samples that are found to be genuinely responded to, can be given more weight-age while analyzing the survey or randomly filled samples can be eliminated.

References
  1. https://en.wikipedia.org/wiki/Survey_data_collection
  2. Questionnaire: https://en.wikipedia.org/wiki/Questionnaire
  3. Onlinesurvey:https://www.techopedia.com/definition/27866/online-survey
  4. Fricker, Ronald D. "Sampling methods for web and e-mail surveys." N. Fielding (2008): 195-216.
  5. Erik Volz, Douglas D. Heckathon, Probability based estimation theory for respondent driven sampling”, In J. of Official Statistics, Vol. 24, no.1, 2008, pp.79-97.
  6. Tansey, Oisín. “Process Tracing and Elite Interviewing: A Case for Non-Probability Sampling.” PS: Political Science &Amp Politics, vol. 40, no. 4, 2007, pp. 765–772.
  7. Schillewaert, Niels, Fred Langerak, and Tim Duhamel. "Non-probability sampling for WWW surveys: a comparison of methods." International Journal of Market Research 40.4 (1998): 307.
  8. Feild, Lucy, et al. "Using probability vs. nonprobability sampling to identify hard-to-access participants for health-related research: costs and contrasts." Journal of Aging and Health 18.4 (2006): 565-583.
  9. Chen, Yen-Liang, and Cheng-Hsiung Weng. "Mining fuzzy association rules from questionnaire data." Knowledge-Based Systems 22.1 (2009): 46-56.
Index Terms

Computer Science
Information Sciences

Keywords

Surveys Questionnaire