Affordable Access

Access to the full text

Diagnostic model of low visibility events based on C4.5 algorithm

Authors
  • Li, Chao1, 2
  • Shi, Yimin3
  • Gao, Ping1, 4
  • Shen, Yang1, 2
  • Ma, Chenchen3, 2
  • Shi, Dawei3, 2
  • 1 Jiangsu Meteorological Station, 210008 , (China)
  • 2 Key Laboratory of Transportation Meteorology, China Meteorological Administration, 210009 , (China)
  • 3 Lianyungang Meteorological Bureau, 222006 , (China)
  • 4 Jiangsu Meteorological Public Service Center, 210008 , (China)
Type
Published Article
Journal
Open Physics
Publisher
Versita
Publication Date
Mar 25, 2020
Volume
18
Issue
1
Pages
33–39
Identifiers
DOI: 10.1515/phys-2020-0007
Source
De Gruyter
Keywords
License
Green

Abstract

In this study the low visibility in Nanjing city is classified and predicted using observed data during 2014 to 2016 with machine-learning based decision tree algorithm (4.5). For this purpose, the model was trained with 3/4th of the data samples until the self-learning accuracy of the model reached 88.32%. The remaining 1/4th of the data samples were used to verify the model’s prediction ability, with the test accuracy reaching 88.34% indicating a good classification diagnosis effect of the model. The results produced with model, generated through learning from the training sample, it is found that the relative humidity, PM10 and PM2.5 are important factors in diagnosing “whether low visibility events will occur in Nanjing”: When relative humidity is favorable (i.e. <90%) and PM2.5 concentration is not high enough (i.e. <146), the probability of low visibility events may reduce; when relative humidity is relatively favorable (i.e. ≥ 90%) with a PM10 concentration ≥ 59, low visibility events are more likely to occur; when relative humidity is extremely favorable (i.e. ≥ 96%) with a low PM10 concentration (i.e. <59), there is also a high probability that low visibility events will occur.

Report this publication

Statistics

Seen <100 times