Peningkatan Kualitas Data dalam Konsolidasi Data Karyawan melalui Data Wrangling

Authors

  • Joni Joni BINUS UNIVERSITY
  • Teguh Prasandy BINUS UNIVERSITY

DOI:

https://doi.org/10.33633/joins.v9i2.9423

Keywords:

Data wrangling, Kualitas data, Persiapan data, Data pegawai

Abstract

Data wrangling is a critical process in data preparation that can significantly improve data quality. Effective data wrangling techniques, which consists of 6 steps i.e. Discovery, Structuring, Cleaning, Enriching, Validating, Publishing, can help Corporate Human Resource Division to ensure that their data is of high quality and ready for analysis. In this case study, we explore how effective data wrangling techniques can be used to improve data quality in employee data consolidation. We found employee data downloaded from various sources, captured incomplete, unreliable, or incorrect so that it could affect data analysis. Data wrangling seeks to remove that risk by ensuring data is in a reliable state before it’s analyzed and leveraged. We analyze a dataset from multiple sources of employee data and demonstrate how data wrangling techniques can be used to clean and transform the data to improve data quality and ready for analysis. Our study provides empirical evidence of the impact of data wrangling on data quality and highlights the importance of this process in employee data consolidation and provide workforce analytics.

Author Biographies

Joni Joni, BINUS UNIVERSITY

Information System

Teguh Prasandy, BINUS UNIVERSITY

Information System

References

Andre, L. (2021). 53 Important Statistics about How Much Data is Created Every Day. FinancesOnline.

Azeroual, O. (2020). Data Wrangling in Database Systems: Purging of Dirty Data. German Center for Higher Education Research and Science Studies (DZHW).

Endel, F., Piringer, H. (2015). Data Wrangling: Making data useful again. Elsevier, IFAC-PapersOnLine, Volume 48, Issue 1, 2015, Pages 111-112.

Furche, T., Gottlob, G., Libkin, L., Orsi, G., Paton, N. (2016). Data Wrangling for Big Data: Challenges and Opportunities. Published in Proc. 19th International Conference on Extending Database Technology (EDBT), March 15-18, 2016 - Bordeaux, France: ISBN 978-3-89318-070-7.

Mazilu, L., Paton, N., Konstantinou, N., Fernandes, A. (2020). Fairness in Data Wrangling. In 21st International Conference on Information Reuse and Integration for Data Science, IRI 2020, Las Vegas, NV, USA.

Patil, M., Hiremath, B. (2018). A Systematic Study of Data Wrangling. I.J. Information Technology and Computer Science, 2018, 1, 32-39 Published Online January 2018 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2018.01.04.

Siegler, M. (2010). Eric Schmidt: Every 2 Days We Create as Much Information as We Did Up To 2003.

Stobierski, T. (2021). Data Wrangling: What it is & why it’s important. Harvard Business School Online.

Taylor, P. (2022). Volume of data/information created, captured, copied, and consumed worldwide from 2010 to 2020, with forecasts from 2021 to 2025. Statista.

Terrizzano, I., Schwarz, P., Roth, M., Colino, J. (2015). Data wrangling: The challenging journey from the wild to the lake. CIDR Conference paper 2015.

Thurber, M. (2018). What is Data Wrangling and Why Does it Take So Long? Elder Research.

Todd, S. (2020). Data wrangling vs. Data Cleaning: What’s the difference? Inzata Analytics.

Vodovatova, E. (2019). What is Data Wrangling (Data Munging)? Steps, Solutions, and Tools. Theappsolutions.

Wolff, R. (2021). What Is Data Wrangling & Why Is It Necessary? Monkeylearn. https://monkeylearn.com

Yasar, K., Essex, D., Rebernik, D. (2022). SAP SuccessFactors. TechTarget.

Downloads

Published

2025-02-18

How to Cite

[1]
J. Joni and T. Prasandy, “Peningkatan Kualitas Data dalam Konsolidasi Data Karyawan melalui Data Wrangling”, Journal of Information System, vol. 9, no. 2, pp. 135–146, Feb. 2025.

Most read articles by the same author(s)

Obs.: This plugin requires at least one statistics/report plugin to be enabled. If your statistics plugins provide more than one metric then please also select a main metric on the admin's site settings page and/or on the journal manager's settings pages.