Summary
Exploring the full potential of BD and its related technologies requires fundamental comprehension of basic data concepts. Before starting any data analytics process, it is essential to understand the multiple data sources available and interdependencies between them. The various types, formats, and magnitude of data are essential factors in the composition of the business context and, consequently, the objective of the analytics program. Additionally, they guide the selection of BD technologies and methodologies to meet the challenges of data acquisition, storage, and processing. A pervasive topic that frequently undermines the success of data analytics programs is Data Quality. Special attention must be dedicated to understand causes and mitigate the effects of weak data. An effective strategy can be created by embedding Data Quality programs into Data Governance programs.
References
- 1. Ozdemir, S., S. Kakade, and Tibaldeschi, Principles of Data Science, 2nd edition. 2018: Packt Publishing, Birmingham.
- 2. Hurwitz, J., A. Nugent, H. Fern, and M. Kaufman, Unstructured Data in a Big Data Environment, [cited 2020 24/03/2020); available from: https://www. dummies.com/programming/big-data/engineering/unstructured-data- in-a-big-data-environment/.
- 3. Marr, B., What's the Difference between Structured, Semi-Structured and Unstructured Data? 2019 [cited 2020 20/03/2020]; available from: https://www. forbes.com/sites/bernardmarr/2019/10/18/whats-the-difference-between- structured-semi-structured-and-unstructured-data/#362053832b4d.
- 4. Firmani, D., M. Mecella, M. Scannapieco, and C. Batini, On the Meaningfulness of ".Big Data Quality". 2015: Springer, Berlin.
- 5. Mohamed, I., A. Rush, J. Yusmadi, and R. Nor, A Review on the performance measurement of big data analytics process. 2018: Faculty of Computer Science and Information Technology, University Putra Malaysia.
- 6. Cheng, Y., C. Ken, S. Hemeng, Z. Yongping, and T. Fei, Data and knowledge mining with big data towards smart production. Journal of Industrial Information Integration, 2017. 9: pp. 1-13.
- 7. Dumas, M., M. La Rosa, J. Mendling, and H. Reijers, Fundamentals of Business Process Management, 2nd edition. 2018: Springer, Berlin.
- 8. Gessert, F., W. Wingerath, S. Friedrich, and N. Ritter. NoSQL database systems: A survey and decision guidance. Computer Science - Research and Development, 2016. 32: 353-365.
- 9. Nazrul, S., CAP Theorem and Distributed Database Management Systems. 2018 [cited 2020 20/03/2020); available from: https://towardsdatascience.com/ cap-theorem-and-distributed-database-management-systems-5c2be977950e
- 10. Gilbert, S. and N. Lynch, Perspectives on the CAP theorem. Computer. 2012. 45(2): pp. 30-36.
- 11. Redman, T, Data's Credibility Problem. 2013 [cited 2020 24/03/2020); available from: https://hbr.org/2013/12/datas-credibility-problem.
- 12. Deloitte. Measuring Data Quality, [cited 2020 24/03/2020]; available from: https:// www2.deloitte.com/ie/en/pages/tax/articles/measuring-data-quality.html.
- 13. Nagle, T., T. Redman, and D. Sammon, Only 3% of Companies' Data Meets Basic Quality Standards. 2017 [cited 2020 24/03/2020]; available from: https://hbr. org/2017/09/only-3-of-companies-data-meets-basic-quality-standards.
- 14. Redman, T., Bad Data Costs the U.S. $3 Trillion Per Year. 2016 [cited 2020 24/03/2020); available from: https://hbr.org/2016/09/bad-data-costs-the-u-s-3- trillionper-year.
- 15. Magoulas, R. and S. Swoyer, The State of Data Quality in 2020. 2020 [cited 2020 24/03/2020]; available from: https://www.oreilly.com/radar/the-state- of-data-quality-in-2020/.
- 16. Taleb, I., El Kassabi, H. T, Serhani, M. A., Dssouli, R., & Bouhaddioui, C. (2016, July). Big data quality: A quality dimensions evaluation. In 2016 Inti IEEE Conferences on Ubicjuitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld) (pp. 759-765). IEEE.
- 17. Strong, D., Y. Lee, and R. Wang, Data quality in context. Communications of the ACM, 1997. 40: p. 103.
- 18. Cai, L. and Y. Zhu, The challenges of data quality and data quality assessment in the big data era. Data Science Journal, 2015.14(2): pp. 1-10.
- 19. Gudivada, V. and J. Ding, Data quality considerations for big data and machine learning: Going beyond data cleaning and transformations. International Journal on Advances in Software, 2017.10(1): pp. 1-20.
- 20. Henderson, D., et al., DAMA-DMBOK: Data Management Body of Knowledge, second edition, 2017: Technics Publications, Denville; NJ.
- 21. Hu, H., Y. Wen, T. Chua, and X. Li, Toward scalable systems for big data analytics: A technology tutorial. IEEE Access, 2014. 2: pp. 652-687.
- 22. Lyko, К., M. Nitzschke, and A. Ngomo, Big data acquisition. In Cavanillas, J., Curry, E., Wahlster, W. (eds.) New Horizons for a Data-Driven Economy: A Roadmap for Usage and Exploitation of Big Data in Europe. 2016: Springer, Cham.
- 23. Strohbach, M., J. Daubert, H. Ravkin, and M. Lischka, Big data storage. In Cavanillas, J., Curry, E., Wahlster, W. (eds.) New Horizons for a Data-Driven Economy: A Roadmap for Usage and Exploitation of Big Data in Europe. 2016: Springer, Cham.
- 24. Irfan, S. and B. Babu, Information retrieval in big data using evolutionary computation: A survey. International Conference on Computing, Communication and Automation (ICCCA), Arizona, USA2016.
- 25. Malavuru, D., R. Shriman, and S. Vijayan, Big Data Analytics in Information Retrieval: Promise and Potential, 2014.
- 26. Haneef, I., M. Ehsan, Q. Ghazia, and U. Hafiz, Big data retrieval: Taxonomy, techniques and feature analysis. International Journal of Computer Science and Network Security, 2018.18: pp. 55-59.
- 27. Abualigah, L. and H. Essam, Apply genetic algorithms to information retrieval using vector space model. International Journal of Computer Science, Engineering and Applications, 2015. 5(1): pp. 19-28.
- 28. Prasanth, T. and M. Gunasekaran, Effective big data retrieval using deep learning modified neural networks. Mobile Networks and Applications, 2019. 24: pp. 282-294.
- 29. Edwards, M., Industry Q&A: Mining Data Management, Analysis and Integrity. 2015 [cited 2020 24/03/2020]; available from: https://www.miningglobal.com/ technology/industry-qa-mining-data-management-analysis-and-integrity.
- 30. Wipro. Application of Big Data Solution to Mining Analytics, [cited 2020 24/03/2020]; available from: https://www.wipro.com/en-BR/natural-resources/ application-of-big-data-solution-to-mining-analytics.
- 31. Geosoft. Geoscience data management report. 2017.
- 32. Seequent. What Challenges Are Mining Companies Facing in Managing Their Geological Data? 2019. [cited 2020 24/03/2020]; available from: https://www. seequent.com/what-challenges-are-mining-companies-facing-in-managing- their-geological-data.
- 33. Song, M., Z. Li, B. Zhou, and C. Li, Cloud computing model for big geological data processing. Applied Mechanics and Materials, 2014. 475-476: pp. 306-311.
- 34. Zhu, Y., Y. Tan, X. Luo, and Z. He, Big data management for cloud-enabled geological information services. Scientific Programming, 2018.2018(7): p. 1.
- 35. Rasekh, A., Hassanzadeh, A., Mulchandani, S., Modi, S., & Banks, M. K. (2016). Smart water networks and cyber security.
- 36. Martins, P., Aplicacao de Cartas de Controle para Analise de Estabilidade de Fluxo Produtivo em um Sistema de Gerenciamento de Frota na Mineracao, 2019.
- 37. Qi, C., Big data management in the mining industry. International Journal of Minerals, Metallurgy and Materials, 2020. 27(2): p. 131.
- 38. Hsu, N., Data quality of fleet management systems in open pit mining: Issues and impacts on key performance indicators for haul trucks fleets. Queen's University, 2015.
- 39. Seery, J., Data management and geotechnical models. In Eighth International Conference on Deep and High Stress Mining, Perth, Australia from 28-30 March, 2017.
- 40. Bergh, L., Artificial intelligence in mineral processing plants: An overview. In International Conference on Artificial Intelligence: Technologies and Applications, January 24-25,2016, in Bangkok, Thailand 2016.
- 41. Bergh, L., J. Yianatos, C. Acuna, H. Perez, and F. Lopez, Supervisory control at Salvador flotation columns. Minerals Engineering, 1999.12: pp. 733-744.
- 42. Houdoin, D., Methods for automatic control, observation, and optimization in mineral processing plants. Journal of Process Control, 2011. 21(2): pp. 211-225.
- 43. Bergh, L. and J. Yianatos, The long way toward multivariate predictive control of flotation processes. Journal of Process Control, 2011. 21: pp. 226-234.
- 44. Oosthuien, D., I. Craig, S.-L. Jamsa-Jounela, and B. Sun, On the current state of flotation modelling for process control. IFAC-PapersOnLine, 2017. 50: pp. 19-24.
- 45. Jovanovic, I., M. Igor, and J. Tomislav, Soft computing-based modelling of flotation processes - A review. Minerals Engineering, 2015. 84: pp. 34-63.
4__