Earn Higher Grades With Instant Assignment Help.Ask Question!

Python Programming
(5/5)

this assignment is to scrape consumer reviews from a set of web pages and evaluate the performance of text classification on the data.

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

The objective of this assignment is to scrape consumer reviews from a set of web pages and evaluate the performance of text classification on the data. The reviews have been divided into five categories here:

http://mlg.ucd.ie/modules/yalp

 

Each review has a star rating. For this assignment, we will assume that 1-star to 3-star reviews are “negative”, and 4-star to 5-star reviews as “positive”.

The assignment should be implemented as a single Jupyter Notebook (not a script). Your notebook should be clearly documented, using comments and Markdown cells to explain the code and results. The assignment can be completed either individually or in pairs.

 

Tasks:

In this assignment you should complete all of the following tasks:

Select two review categories of your choice. Scrape all reviews for each category and store them as two separate datasets. For each review, you should store the review text and a class label (i.e. whether the review is “positive” or “negative”). 

 

For both category datasets: 

From the reviews in this category, apply appropriate preprocessing steps to create a numeric representation of the data, suitable for classification.

Build a classification model using a classifier of your choice, to distinguish between “positive” and “negative” reviews.

Test the predictions of the classification model using an appropriate evaluation strategy. Report and discuss the evaluation results in your notebook.

Evaluate how well your two classification models transfer between category. That is, run experiments to:

Train a classification model on the data from “Category A”, and evaluate its performance on the data from “Category B”.

Train a classification model on the data from “Category B”, and evaluate its performance on the data from “Category A”.

 

Guidelines:

The assignment can be completed either individually or in pairs. Any evidence of plagiarism will result in a 0 grade.

For the assignment, only these third-party packages can be used: NumPy, Pandas, Scikit-learn, NLTK, SciPy, Requests, BeautifulSoup, Matplotlib, Seaborn, Gensim.

Attachments:
(5/5)

Related Questions

CSI 1420 Introduction to C Programming & Unix Fall 2018, CRN 44882, Oakland University Homework Assignment 6 - Using Arrays and Functions in C

DescriptionIn this final assignment, the students will demonstrate their ability to apply two majorconstructs of the C programming language – Fu

The standard path finding involves finding the (shortest) path from an origin to a destination, typically on a map. This is an

Path finding involves finding a path from A to B. Typically we want the path to have certain properties,such as being the shortest or to avoid going t

Develop a program to emulate a purchase transaction at a retail store. This program will have two classes, a LineItem class and a Transaction class. The LineItem class will represent an individual

Develop a program to emulate a purchase transaction at a retail store. Thisprogram will have two classes, a LineItem class and a Transaction class. Th

SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of Sea Ports. Here are the classes and their instance variables we wish to define:

1 Project 1 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of

Project 2 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of Sea Ports. Here are the classes and their instance variables we wish to define:

1 Project 2 Introduction - the SeaPort Project series For this set of projects for the course, we wish to simulate some of the aspects of a number of

Ask This Assignment To Be Done By Our ExpertsGet A+ Grade Solution Guaranteed

expert
joyComputer science
(4/5)
12 Answers Hire Me
expert
Robert DLaw
(4.8/5)
859 Answers Hire Me
expert
Dr Samuel BarberaStatistics
(5/5)
595 Answers Hire Me
expert
Tutor For YouEconomics
(5/5)
902 Answers Hire Me