Online ISSN: 2515-8260

Keywords : Data Mining


Using Web Scraping In A Knowledge Environment To Build Ontologies Using Python And Scrapy

M. El Asikri; S. Krit; H. Chaib

European Journal of Molecular & Clinical Medicine, 2020, Volume 7, Issue 3, Pages 433-442

Web scraping, or web data extraction is data scraping used for extracting data from websites. Web scraping software may access the World Wide Web directly using the Hypertext Transfer Protocol, or through a web browser. While web scraping can be done manually by a software user, the term typically refers to automated processes implemented using a bot or web crawler. It is a form of copying, in which specific data is gathered and copied from the web, typically into a central local database or spreadsheet, for later retrieval or analysis.
In this paper, among others kind of scraping, we focus on those techniques that extract the content of a Web page. In particular, we adopt scraping techniques in the Web e-commerce field. To this end, we propose a solution aimed at analyzing data extraction to exploiting Web scraping using python and scrapy framework .

A REAL AND ACCURATE FAKE PRODUCT DETECTION SYSTEM AND GENERATE ORIGINAL REVIEWS USING DATA MINING MECHANISM

VVS SASANK; Praveen S R Konduri; Prasanna Kumar Prathipati

European Journal of Molecular & Clinical Medicine, 2020, Volume 7, Issue 4, Pages 952-959

Most of the people requires genuine information about the online product. Before spending their economy on particular product can analyze the various reviews in the website. In this scenario, they did not identify whether it may be fake or genuine. In general, some reports in the websites are good, company technical people itself add these for making the product famous. These people belong to media and social organization teams, they give reviews with a good rating by their own firm. Online purchasers did not identify the fake product because of this falsification in the reviews of the website. In this research,the SVM classification mechanism has been used for detect the fake reviews by using IP address. This implementation helpful for users find out the correct review of online product. In this accuracy is improved by 98.79%, F1 score increases by 10%.

Development of Top K-Association Rule Mining for Discovering pattern in Medical Dataset

Aakriti Sharma; Anjana Sangwan; Blessy Thankchan; Sachin Jain; Veenita Singh; Shantanu Saurabh

European Journal of Molecular & Clinical Medicine, 2020, Volume 7, Issue 4, Pages 1413-1421

Association rules consist of the discovery of association between mining transaction items. This is one of the most important information mining jobs. It has been integrated into many commercial data mining software and has a wide variety of applications on a number of domains. So, computing the prediction rules in top rank data set is very difficult task. Finding the pattern in large data set require memory computational power high rate of I/O. and it is possible only on high computational machine. In this paper, selection of parameter which is used to compute is chosen based on minimum support and minimum confidence value. In this paper proposed a new algorithm which generates the association rule for the input parameters to finding the pattern in large data set. The algorithm starts searching the rules. As soon as a rule is found, it is added to the list of order rules list by support. The list is used so far to maintain top N rules found. Once valid rules are found, the minimum support for the internal minsup variable list is raised to support the rule. When the Minsup value is raised, the search space is robbed while searching for more rules. Then, every time a valid rule is found, the list is inserted into the list, the lists that are not listed in the list are excluded from the list and the minsup is raised for the price of the least fun rules in the list. Result shows that new method is efficient technique to mine data set from standard data with minimum configuration system.