Predictive Maintenance in IoT Using NLP Techniques
annif.suggestions | machine learning|sensors|data|upkeep (servicing)|Internet of things|maintenance|measurement|data mining|reliability (general)|neural networks (information technology)|en | en |
annif.suggestions.links | http://www.yso.fi/onto/yso/p21846|http://www.yso.fi/onto/yso/p11460|http://www.yso.fi/onto/yso/p27250|http://www.yso.fi/onto/yso/p9046|http://www.yso.fi/onto/yso/p27206|http://www.yso.fi/onto/yso/p9047|http://www.yso.fi/onto/yso/p4794|http://www.yso.fi/onto/yso/p5520|http://www.yso.fi/onto/yso/p1629|http://www.yso.fi/onto/yso/p7292 | en |
dc.contributor.author | Hassan, Syed | |
dc.contributor.faculty | fi=Tekniikan ja innovaatiojohtamisen yksikkö|en=School of Technology and Innovations| | - |
dc.contributor.organization | fi=Vaasan yliopisto|en=University of Vaasa| | |
dc.date.accessioned | 2025-05-28T10:55:18Z | |
dc.date.accessioned | 2025-06-25T18:02:55Z | |
dc.date.available | 2025-05-28T10:55:18Z | |
dc.date.issued | 2025-05-16 | |
dc.description.abstract | In this thesis we study how NLP techniques can be integrated with Internet of Things (IoT) sensor data, to be used for predictive maintenance in industrial environments. The main objective was to build models that can predict whether an equipment will fail in the next 7 days using the iot_predictive_maintenance dataset. Sensor readings include temperature, vibration, humidity, and pressure etc. along with unstructured textual maintenance logs. Two sets of the feature were built: one containing only numeric sensor data and the other with features from TextBlob, namely TF- IDF vectors and polarity scores on sentiment, to create a feature set. Logistic Regression, Random Forest, Gradient Boosting, and XGBoost models were evaluated multiple times. It was found that models incorporating both sensor and NLP features significantly outperformed those based solely on sensor data. Among the evaluated models, the best performance was achieved by the XGBoost model using the combined feature set, which attained an accuracy of 0.973, an F1 score of 0.889, and a ROC AUC of 0.993. These results confirm that the textual in-formation in maintenance logs contains important failure aspects not expressed from numerical data alone. This work demonstrates practicality of fusion models in predictive maintenance, enabling a scalable and robust solution for smart manufacturing. | - |
dc.format.bitstream | true | |
dc.format.extent | 82 | - |
dc.identifier.olddbid | 23649 | |
dc.identifier.oldhandle | 10024/19449 | |
dc.identifier.uri | https://osuva.uwasa.fi/handle/11111/12483 | |
dc.identifier.urn | URN:NBN:fi-fe2025051646071 | - |
dc.language.iso | eng | - |
dc.rights | CC BY 4.0 | - |
dc.source.identifier | https://osuva.uwasa.fi/handle/10024/19449 | |
dc.subject.degreeprogramme | Master's Programme in Sustainable and Autonomus Systems (SAS) | - |
dc.subject.discipline | fi=Tietoliikennetekniikka|en=Telecommunications Engineering| | - |
dc.subject.yso | machine learning | - |
dc.subject.yso | Internet of things | - |
dc.subject.yso | natural language processing | - |
dc.title | Predictive Maintenance in IoT Using NLP Techniques | - |
dc.type.ontasot | fi=Diplomityö|en=Master's thesis (M.Sc. (Tech.))|sv=Diplomarbete| | - |
Tiedostot
1 - 1 / 1
Ladataan...
- Name:
- Uwasa_2025_Mahmood_ul_Hassan_Syed.pdf
- Size:
- 2.26 MB
- Format:
- Adobe Portable Document Format
- Description:
- This thesis investigates the integration of Natural Language Processing (NLP) techniques with IoT sensor data to enhance predictive maintenance in industrial settings. The research utilizes sensor data (temperature, vibration, humidity, pressure) combined with textual maintenance logs to predict equipment failures within a 7-day time frame. Machine learning models, including Logistic Regression, Random Forest, Gradient Boosting, and XGBoost, were evaluated using sensor-only data and NLP-augmented datasets. Results demonstrate that models enriched with NLP features significantly outperform traditional sensor-only models, with the NLP-enhanced XGBoost model achieving an accuracy of 97.3%, an F1 score of 0.889, and ROC-AUC of 0.993. The study highlights the practical benefits of combining structured and unstructured data, paving the way for more robust and context-aware predictive maintenance systems in smart manufacturing.