Developing an Agricultural Portal for Crop Disease Prediction

Page 1

IJIRST –International Journal for Innovative Research in Science & Technology| Volume 3 | Issue 11 | April 2017 ISSN (online): 2349-6010

Developing an Agricultural Portal for Crop Disease Prediction Samiksha Bhor UG Student K.J. Somaiya Institute of Engineering & Information Technology, Sion, Mumbai, India

Shubha Kotian UG Student K.J. Somaiya Institute of Engineering & Information Technology, Sion, Mumbai, India

Aishwarya Shetty UG Student K.J. Somaiya Institute of Engineering & Information Technology, Sion, Mumbai, India

Prashant Sawant Assistant Professor K.J. Somaiya Institute of Engineering & Information Technology, Sion, Mumbai, India

Abstract Agriculture is the basic occupation of all Indians. Farmer is said to be man of nation. We consider this as our responsibility to explore this occupation and take it to a higher level from technology point of view. The basic purpose for developing this system is crop disease prediction using various data mining techniques. Our project describes a new approach to crop disease prediction which helps to prevent future economical losses. This project emphasizes on every single concept related to crop diseases. This is accomplished by building a web platform in which farmers can interact with expert, share their experiences and knowledge. This results in a dynamically-growing online survey, which ultimately helps in data collection that can be used to identify various crop diseases and helps to prevent them. This portal can be used for multiple purposes where agro based industries can use our data to launch their products as well as acquire feedbacks. Agricultural institutes can explore new patterns in crop diseases and use required technology to prevent them. This system will be helpful for students perceiving agriculture studies; they can collect the correct information from the appropriate source and in precise manner. Keywords: CBIR, Crop diseases, Data Mining, Disease prediction, Image Comparison, Naive Bayes _______________________________________________________________________________________________________ I.

INTRODUCTION

India is a cultivated country and about 70% of the population depends on agriculture. Large range of diversity for selecting various suitable crops and finding the suitable pesticides for plants is found. Disease on plant leads to the significant reduction in both the quality and quantity of overall agricultural products. The studies of plant disease refer to the visually observable patterns on the plants. Monitoring of diseases on plant plays an important role in successful crop cultivation. In early days, the monitoring and analysis of plant diseases were done manually by the expertise person in that field. This process requires tremendous amount of work hence requires excessive processing time. The plant disease detection can be done effectively with the help of various image processing techniques. In most of the cases disease symptoms are seen on the leaves, stem and fruit. The early detection of diseases on plants is really required as a very small number of diseased crops can spread the infection to the whole batch of fruits and vegetables and thus affects further storage and sales of agriculture products. This effect of plant diseases are very destructive as a lot of farmers were discouraged to the point where some decided to give up the work of crop cultivation. II. OVERVIEW OF SYSTEM The basic purpose for developing this system is crop disease prediction using various data mining techniques. Our project describes a new approach to crop disease prediction which helps to prevent future economical losses. This project emphasizes on every single concept related to crop diseases. This is accomplished by building a web platform in which farmers can interact with expert, share their experiences and knowledge. In Agriculture sector plants or crop cultivation have seen fast development in both the quality and quantity of food production, however, the presence of pests and diseases on crops especially on leaves has hindered the quality of agricultural goods. There is therefore a need to identify these diseases at an early or superior stage and suggest solutions so that maximum harms can be avoided to increase crop yields. System Modules: Test data acquisition module: Test data acquisition module is test data acquisition or Data selection as the process of determining the appropriate data type and source, as well as suitable instruments to collect data. Data selection precedes the actual practice of data collection. This definition distinguishes data selection from selective data reporting (selectively excluding data that is not supportive of a

All rights reserved by www.ijirst.org

144


Developing an Agricultural Portal for Crop Disease Prediction (IJIRST/ Volume 3 / Issue 11/ 025)

research hypothesis) and interactive/active data selection (using collected data for monitoring activities/events, or conducting secondary data analyses). The process of selecting suitable data for a research project can impact data integrity. The two primary data types are quantitative (represented as numerical figures - interval and ratio level measurements), and qualitative (text, images, audio/video, etc.). Data pre-processing module: Data preprocessing describes any type of processing performed on raw data to prepare it for another processing procedure. Commonly used as a preliminary data mining practice, data preprocessing transforms the data into a format that will be more easily and effectively processed for the purpose of the user. In order to improve the quality of the data and consequently, of the mining results so as to improve efficiency and ease of mining process, the data is preprocessed Image comparison module : All the images of crops affected by the disease and those crops not affected by diseases are stored in the database. In this case CBIR[11] is used, CBIR uses three steps to extract the properties from image on the basis of colour, texture, shape and properties or features are stored in the Database.When the user uses his android phone and gets a snap of the image, the features of query image are first extracted using CBIR. The images in the database are first grouped with respect to the features of images like colour, texture, shape. Features which are extracted from the query image are compared with the clusters and a set of images are produced.Iimages matching nearest to the query image are generated. Finally, result is returned to the user. Data mining module : Data mining is the process of extracting valid information from large databases. Data mining used in many research areas. Data mining is widely used in agricultural data processing. There are many techniques of Data mining which are developed for decision making (DSS). If proper decision making techniques are applied on data then these data are stored in data bases. These data are used to understand the hidden correlation between crop-pest and disease parameters . Result analysis And testing : Plant disease forecasting models must be thoroughly tested and validated after being developed. Interest has arisen lately in model validation through the quantification of the economic costs of false positives and false negatives, where disease prevention measures may be used when unnecessary or not applied when needed respectively. The costs of these two types of errors need to be weighed carefully before deciding to use a disease forecasting system. III. SYSTEM DESIGN

Fig. 1: System flow diagram

This system provides an interactive portal where farmer needs to register his details. After successful login he will be given access to multiple options like upload image, enter crop details and contact expert. User can upload single or multiple images of affected crop . The images uploaded by user will be compared with images present in database using CBIR( Content Based Image Retrieval ) Technique. The user can enter crop details like soil quality, sunshine hours, temperature, humidity, rain-fall,

All rights reserved by www.ijirst.org

145


Developing an Agricultural Portal for Crop Disease Prediction (IJIRST/ Volume 3 / Issue 11/ 025)

pesticides, fertilizers, seed variety, area .These crop details will be processed and used to predict crop disease using Naive-Bayes Classification algorithm. User can also contact to expert for seeking advice and to solve his queries and get necessary solutions. Pseudo Code for image comparison: if (sizeA == sizeB) { for (int i = 0; i < sizeA; i++) { if (dbA.getElem(i) == dbB.getElem(i)) { count = count + 1; } } IV. IMPLEMENTATION We are using two techniques for implementation: 1) CBIR for image comparison 2) Naive Bayes Classification for data mining Content Based Image Retrieval (CBIR) Techniques - Image Comparison CBIR techniques works in a different way from the traditional text base system. Images stored in a cluster are extracted and then features of those images are used for comparison. Features of the images are automatically extracted. CBIR System mainly use color, shape and texture as their basic features so they operate on level one which is at low level. Traditional system takes query from user by taking input as image while some systems offer extra options to the user such as palette or sketch input. Next step system compares the query image with stored image and whose feature values matches closely, those images are shown to the user. Main retrieval types for image retrieval are explained below. Color Retrieval When an image is used for comparison, it is first processed and colour histogram is derived from the image, it shows the proportion of colours in each pixel of the image. Colour histogram derived from image is stored in database. Images whose colour histogram matches closely with query image are considered. Images whose colour histogram matches closely with query image are considered. Texture Retrieval Texture similarity is done by calculating values from query and image stored in database. Parameter of adaptive brightness from the pair of pixel of two images is considered for comparison. Values are calculated on the basis of scale, degree of contrast, directionality, periodicity for texture analysis.Texture equality can be used for differentiation between colours and area of images. Texture comparison is done by submitting query image or by selecting texture from palette. System then considers those image whose texture measures match closely with query image. Shape Retrieval Third technique is to obtain the information on the image using shape retrieval .Basic requirement to retrieve the property of image is by their shape which is at the basic level of shape retrieval technique.Two important types of shape feature are usually used like global features as aspect ratio, circularity and moment invariants and local features as sets of successive boundary segments. The Queries are answered by processing the equivalent features for the query image and obtaining those images whose features nearly match with query image. NAIVE BAYES CLASSIFICATION – FOR DATA MINING It is a classification technique based on Bayes’ Theorem with an assumption of independence among predictors. In simple terms, a Naive Bayes classifier assumes that the presence of a particular feature in a class is unrelated to the presence of any other feature.Naive Bayes model is easy to build and particularly useful for very large data sets. Along with simplicity, Naive Bayes is known to outperform even highly sophisticated classification methods. Bayes’ Theorem is stated as : P(h|d) = (P(d|h) * P(h)) / P(d) Where  P(h|d) is the probability of hypothesis h given the data d. This is called the posterior probability.  P(d|h) is the probability of data d given that the hypothesis h was true.  P(h) is the probability of hypothesis h being true (regardless of the data). This is called the prior probability of h.  P(d) is the probability of the data (regardless of the hypothesis).

All rights reserved by www.ijirst.org

146


Developing an Agricultural Portal for Crop Disease Prediction (IJIRST/ Volume 3 / Issue 11/ 025)

V. CONCLUSION The early detection of diseases on plants is much required as a very small number of diseased crops can spread the infection to the whole crop in the field and thus affects further storage and sales of agriculture products.This system can be used for multiple purposes where agro based industries can launch their products and acquire feedbacks. Agricultural institutes can explore new inventions and technologies for farmers. This system will help to increase income from overall crop production. This system will be an interactive one which helps to overcome communication barrier through interaction between farmers and experts.The system reduces ocurrences of diseases in crops and thus prevents future economical losses. ACKNOWLEDGMENT We wish to express our sincere gratitude to Prof. Prashant Sawant, Project Guide for providing us an opportunity to do our project work in Agricultural domain. We sincerely thank Mr. Uday Rote, HOD of IT Department and Mr. Harsh Bhor, Project Co-ordinator for their guidance and encouragement in carrying out this project work. We also wish to express our gratitude to the officials and other staff members of K.J Somaiya Institute of Engineering and Information Technology, who rendered their help during the period of our project work. REFERENCES [1] [2] [3] [4] [5] [6] [7] [8]

Cios K J,Pedrycz W,Swiniarsk R M.Data Mining methods for knowledge discovery.[J].IEEE Trans Neural Netw.1998,9(6): 15331534 Turechek W W. Nonparametric Test in plant disease epidemiology : Characterizing disease association [J]. Phytopathology . 2004 , 94(9) :1018-1021 K.Srinivas,” Mining Association Rules from Large Datasets towards Disease Prediction”, 2012International Conference on Information and Computer Networks (ICICN 2012) IPCSIT. S.Mahendra Dev,General Arun Kumar “Small Farmers in India:ChallengesandOpportunities”.http://www.igidr.ac.in/pdf/publication/WP-2012-014.pdf Xu X M, Ridout M S. Effects of quadrat size and shape, initial epidemic conditions, and spore dispersal gradient on spatial statistics of plant disease epidemics.[J]. Phytopathology. 2000, 90(7): 738-750. Fayyad U M, Piatetsky-Shapiro G, Smyth P. From data mining to knowledge discovery: an overview [M]. Advances in knowledge discovery and data mining , American Association for Artificial Intelligence, 1996, 1-34. Poorani, Prathiba, Ravindran, “Integrated Feature Extraction for Image Retrieval”, IJCSMC Vol. 2, Issue. 2, February2013. M. H. Saad, H. I. Saleh, H. Konbor, M. Ashour, “Image Retrieval based on Integration between Color and Geometric Moment Features”, Arab Journal of Nuclear Sciences and Applications45(2)447-454(2012), 2012.

All rights reserved by www.ijirst.org

147


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.