Abstract
Outliers with the rest of the data points which are different from or inconsistent. New novel, unusual, abnormal or may contain noise. Outliers are sometimes more interesting than the majority of the data. Increasing complexity, size and variety of datasets with major challenges outlier detection, a group, and how to evaluate similar as outliers outliers are to catch. This paper is an approach to detect outlier as a pre-processing step that uses semi surveillance describes outlier detection and then the fuzzy c-means clustering and genetic algorithm to cluster analysis dataset applies to analyze the effects of outliers. As data is digitized, connected and integrated systems, getting the scope of data and analyzes has been growing rapidly. Today, the system's most massive, the size, volume, speed of the phenomenon is changing rapidly, and the non-stationary data generated by these types of data are called data streams. Stream data and your issues in detail in this paper we explore the d