Peningkatan Kualitas Data dalam Konsolidasi Data Karyawan melalui Data Wrangling
DOI:
https://doi.org/10.33633/joins.v9i2.9423Keywords:
Data wrangling, Kualitas data, Persiapan data, Data pegawaiAbstract
Data wrangling is a critical process in data preparation that can significantly improve data quality. Effective data wrangling techniques, which consists of 6 steps i.e. Discovery, Structuring, Cleaning, Enriching, Validating, Publishing, can help Corporate Human Resource Division to ensure that their data is of high quality and ready for analysis. In this case study, we explore how effective data wrangling techniques can be used to improve data quality in employee data consolidation. We found employee data downloaded from various sources, captured incomplete, unreliable, or incorrect so that it could affect data analysis. Data wrangling seeks to remove that risk by ensuring data is in a reliable state before it’s analyzed and leveraged. We analyze a dataset from multiple sources of employee data and demonstrate how data wrangling techniques can be used to clean and transform the data to improve data quality and ready for analysis. Our study provides empirical evidence of the impact of data wrangling on data quality and highlights the importance of this process in employee data consolidation and provide workforce analytics.References
Andre, L. (2021). 53 Important Statistics about How Much Data is Created Every Day. FinancesOnline.
Azeroual, O. (2020). Data Wrangling in Database Systems: Purging of Dirty Data. German Center for Higher Education Research and Science Studies (DZHW).
Endel, F., Piringer, H. (2015). Data Wrangling: Making data useful again. Elsevier, IFAC-PapersOnLine, Volume 48, Issue 1, 2015, Pages 111-112.
Furche, T., Gottlob, G., Libkin, L., Orsi, G., Paton, N. (2016). Data Wrangling for Big Data: Challenges and Opportunities. Published in Proc. 19th International Conference on Extending Database Technology (EDBT), March 15-18, 2016 - Bordeaux, France: ISBN 978-3-89318-070-7.
Mazilu, L., Paton, N., Konstantinou, N., Fernandes, A. (2020). Fairness in Data Wrangling. In 21st International Conference on Information Reuse and Integration for Data Science, IRI 2020, Las Vegas, NV, USA.
Patil, M., Hiremath, B. (2018). A Systematic Study of Data Wrangling. I.J. Information Technology and Computer Science, 2018, 1, 32-39 Published Online January 2018 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijitcs.2018.01.04.
Siegler, M. (2010). Eric Schmidt: Every 2 Days We Create as Much Information as We Did Up To 2003.
Stobierski, T. (2021). Data Wrangling: What it is & why it’s important. Harvard Business School Online.
Taylor, P. (2022). Volume of data/information created, captured, copied, and consumed worldwide from 2010 to 2020, with forecasts from 2021 to 2025. Statista.
Terrizzano, I., Schwarz, P., Roth, M., Colino, J. (2015). Data wrangling: The challenging journey from the wild to the lake. CIDR Conference paper 2015.
Thurber, M. (2018). What is Data Wrangling and Why Does it Take So Long? Elder Research.
Todd, S. (2020). Data wrangling vs. Data Cleaning: What’s the difference? Inzata Analytics.
Vodovatova, E. (2019). What is Data Wrangling (Data Munging)? Steps, Solutions, and Tools. Theappsolutions.
Wolff, R. (2021). What Is Data Wrangling & Why Is It Necessary? Monkeylearn. https://monkeylearn.com
Yasar, K., Essex, D., Rebernik, D. (2022). SAP SuccessFactors. TechTarget.
Downloads
Published
How to Cite
Issue
Section
License
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
This work is licensed under a Creative Commons Attribution 4.0 International License.