A Study of Challenges and Limitations to Applying Machine Learning to Highly Unstructured Data
Citations Over TimeTop 20% of 2023 papers
Abstract
In recent years, there has been a lot of curiosity about the use of machine learning algorithms to analyze unstructured data, including social media posts and videos. The limitations and difficulties posed by these technologies must be overcome in order to guarantee accurate and trustworthy results, despite the promise they hold. This study examines the restrictions and difficulties associated with using machine learning algorithms on unstructured data. We identify the main difficulties in working with unstructured data, such as the lack of standardization and the difficulty in handling noise and bias, through a literature review and case studies. We then look at methods for overcoming these obstacles, such as feature engineering, pre-processing techniques, and ensemble methods. The results of this study have implications for both theoretical and applied work in machine learning, particularly in the fields of feature engineering, model selection, and data cleaning. This paper's goal is to provide a thorough understanding of the constraints and difficulties associated with using machine learning algorithms on unstructured data, as well as to contribute to the creation of best practices for handling these data type
Related Papers
- → Children’s scientific curiosity: In search of an operational definition of an elusive concept(2012)351 cited
- → On educating, curiosity, and interest development(2020)84 cited
- → Curiously different: Interest-curiosity and deprivation-curiosity may have distinct benefits and drawbacks(2023)23 cited
- → Space for curiosity(2013)38 cited
- → Led by curiosity and responding with voice: The influence of leader displays of curiosity and leader gender on follower reactions of psychological safety and voice(2022)24 cited