Professional Documents
Culture Documents
I. INTRODUCTION
Air pollution is the biological matter or other harmful
materials in the Earths atmosphere. Air pollutants can have
adverse effects on human lives. The pollutants can have
harmful effects not only on human but also on all kind of
living beings. The substance can be solid particles, liquid
droplets or gaseous substances. Pollutants can be of natural
origin or artificially industrial man made pollutants. They are
classified as primary or secondary types of pollutants.
The primary kinds are those which are produced from
processes of nature, for example, ashes from volcanic
eruptions, CO from vehicles gas emulsion. The secondary are
the kinds where the pollutants are spread indirectly in the air
when primary pollutants react with ground level ozone.
In this paper the primary emulsion of pollutants are
calculated. NOx is spread from thunderstorm and industrial
works. SO2 is spread from the volcanic eruptions, industrial
works petroleum combustion which can be caused from
various sources mostly from vehicles. Further oxidization in
the atmosphere let it remain as a catalyst which later turn into
acid rain in atmosphere. CO is created from combustion
through vehicles, wood burning, natural gases and coal etc. O3
is caused by the disturbance of almost the same process of
materials and mixtures of chemicals and oxidization of them.
PM10 and PM 2.5 are particulate matters containing lead and
other harmful matter in solid matters and liquid droplets in the
atmosphere. The types are in between 2.5 micrometers to 10
micrometers. The coarse particulates are PM10 and the finer
particulates are PM 2.5. PM 10 is smoke, dust and dirt from
roadside and factories. The PM10 is created from the process
that rocks and soil go through that includes mixtures of rocks
and metals turned into smaller state and mixed. The finer
particulate matter contains toxic organic compound and heavy
metals. They are created from driving automobiles, burning
plants and smelting or purifying process from metals. Human
body responds to particular invasion more than any other
pollutants in the atmosphere. PM2.5 is more harmful to the
living being than PM10.
Particle matters trigger diseases like asthma and play an
important part in respiratory diseases and as well in dangerous
cancer diseases causing premature deaths. It is stated that
some air pollutants increase in dry season compared to rainy
seasons. Most of the lung diseases are the result of air
pollutants present in our atmosphere.
It is the major problem faced in day to day lives. It is
getting more serious with the development of the growing
cities and their increasing population. The increase in the
number of vehicles, increase in industrial expansion, etc. are a
major turnabout for pollution. Bangladesh is also facing this
problem due to continuous increase of population. As it is a
developing country with its increasing number of vehicles, air
pollution is now affecting the health of the people.
The conversion of vehicles, establishment of factories,
illegal disposing of chemical products and usage of ozone
harming products are main causes of pollution. Different
levels of air pollutants present in air also causes health
problems. Such pollutants are Carbon Monoxide (CO),
Nitrogen Oxide (NOx), Sulfur Oxide (SOx), PM10 also Pm2.5.
These pollutants cause some serious health problems and
diseases which can be fatal in result. These pollutants not only
affect the healthy adults but also the children. These diseases
affect a certain age of people.
In this research the presence of air pollutant, their
percentage and the number admitted patient in the hospital are
investigated using the decision tree algorithm to learn the
effect of the emission of vehicles fuel on air and also the
effect of air pollutant on peoples health. This study is focused
on Dhaka city. As Bangladesh is a developing country so the
technologies and software here are not quite updated for data
mining task, thus data processing could be hard.
Medical data are collected from well-known NIDCH
(National Institute of Disease of the Chest and Hospital) and
the air pollution data are collected from DoE (Department of
Environment). The main goal of this study is to find the level
of air pollutants in the atmosphere and also the effect of
emission of vehicles fuel on air so that necessary step could
be taken to prevent the health risks due to their exposure.
Mean =
Mean=
.
.
Jun
Jul
6.61
Dec
7.76
=7.42
4.53
Aug
Average = (+ )/2=5.57
Checking if < and < .
Add x to or subtract x from using ones judgment in
order to see which value fits best for the missing value . So
the calculated missing value is is6.27.
IV. ALGORITHMS
Two algorithms were used in the overall project. First a kmeans clustering algorithm was used in order to cluster the
records of air pollution levels and also the numbers of patient
admissions to the NIDCH. Then the CART analysis was
performed on both the data sets in order to create a decision
tree model which results in proper classification of the data.
Our approaches to implement the algorithms are explained
further below.
4.1. Clustering using k-Means algorithm:
Given the nature of our data set already discussed earlier the
k-means algorithm suited best to work with. The algorithms
tendency to locate clusters of comparable spatial extent and
different shapes is useful as our data of air pollution levels
contains attributes with very high values compared to values
of other attributes within the set. The application of the
algorithm in feature learning also comes in handy in
recognizing the trends of the increase and decrease of air
pollution levels. For all the clusters the divergence of the data
was a measure of the squared Euclidean distance among the
objects.
The traditional Bengali calendar consists of six seasons.
However, for our project, we specified the seasons of
[2]
[3]
[4]
[5]
[6]
COPD
Age Group: 50+
Tree 1a (I.G)
Tree 1b (G.I)
Tree 1c (I.G)
Tree 1d (G.I)
ILD
Age Group: 50+
Tree 2d (G.R)
Tree 2b (G.I)
Tree 2c (I.G)
Bronchial Carcinoma
Age Group: 50+
Age Group: 24-49
Tree 2a (I.G)
Tree 3a (G.I)
Tree 3b (G.R)
Tree 3c (I.G)
Tree 3d (G.R)
Figure 4: Decision trees generated for the different age groups