Python代写:ITD104Fantales


Introduction

这次代写的web作业,是用HTML, Python, SQLite做几个页面,展示数据库中的数据。环境搭好的话,代码不算太难。

Overview

This task requires you to use several of the technologies introduced this
semester: programming (Python), database queries (SQLite) and web development
(HTML). These same skills, which you will acquire by completing this task at
home, will subsequently be tested under time pressure in the “in-class test.”

Goal

This task involves writing a Python program which interrogates an SQLite
database and generates HTML documents as its output. It is thus an excellent
example of the main theme of ITD104, i.e., the way different “computer
languages”
can be used together to build new IT systems.
The aim of this task is to develop a program which creates HTML documents that
visualise movie actors’ popularity amongst their fans, according to a survey
of movie customers stored in a data-base. More specifically, given a
particular category of movie fans, for example, females or customers whose age
is between 30 to 40, we want to know how popular the actors stored in the
movie database are with this group. The required output is a set of HTML
pages, one per customer group. Each page must contain a visual representation
of the actors’ popularity in which the more popular actors names are displayed
in a bigger font.

The Database

An SQLite database backup file or ‘dump’ called movie_survey.sql containing
the data to use for this task has been supplied. It contains results from a
survey taken by Microsoft employees in 2002 about their favourite actors and
movies. In order to complete this task, you need to first recreate this
database by running the supplied script in the SQLite DB Browser. Please refer
to the exercises in the Week 7 workshop for how to restore a database from a
database dump.
There are four tables in this database, actors, customers, actors_movies and
favorite_actors. Among them, two tables are needed for this task, customers
and favorite_actors. The definition of these two tables is explained below.

Table customers

Table customers provides information about movie patrons or customers. Each
row in the table consists of seven fields as illustrated below. Field
customerID is the identifier of a customer and is the primary key. Fields two
to seven provide the customer’s personal information including age, education
level, gender, marital status, number of children and number of cars. The
fields are defined by the schema below:

name type
customerID INT(6)
age INT(11)
education_level VARCHAR(30)
gender VARCHAR(10)
marital_status VARCHAR(20)
number_of_cars INT(11)
number_of_children INT(11)

Table favorite_actors

Table favorite_actors provides information about movie fans’ favourite actors.
Each row in the table consists of three fields as illustrated below. ID is the
identifier of this record, customerID is the identifier of a customer, and
actor is the name of an actor who is one of this fan’s favourites. The fields
are defined by the schema below:

name type
ID INT(6)
customerID INT(6)
actor VARCHAR(100)
Notice that the format of actor names in this database is “given_name surname”
as shown in the tables above. When you order and display the names as required
in this task, you can treat each name “given_name surname” as one string
without separately dealing with given names and surnames. (This means that the
names will be sorted on the actors’ given names.)

General Requirements

The aim of this task is to visually show the popularity of actors among
specific categories of movie fans. You are required to develop a Python
program that accesses the SQL database described above to generate HTML
documents. The user of your program specifies:

  • How many results to display; and
  • Which categories of movie fans’ opinions to display.
    The result is a collection of hyperlinked HTML documents, one per category of
    movie customer. The different categories of customer that the user can specify
    are:
  • Male fans;
  • Female fans;
  • Fans in a certain age range, e.g., 30 to 40, inclusive; and
  • All fans.
    An example of one of the HTML documents is shown in Figure 1 on the following
    page. It shows the popularity of actors with movie customers with an age
    between 30 and 40, inclusive. (This page is one of those to be produced by
    Test 2 in the provided template file fantales.py.) The popularity of an actor
    is defined by the number of customers who like this actor.
    The idea is that more popular actors are displayed in larger fonts. For
    instance, Tom Hanks is more popular than Sandra Bullock because Tom Hanks has
    420 fans in this age range, while Sandra Bullock has only 130. In Figure 1,
    the font used for Tom Hanks is obviously larger than that of Sandra Bullock.
    Each actor’s name is also followed by the number of movie customers who like
    this actor, in a smaller, unintrusive font. For instance, in Figure 1, Tom
    Hanks is associated with number 420 because he is a favourite actor to 420
    customers.
    In this task, we also use different colours to display actors whose
    popularities are different. Actors with the same popularity are displayed in
    the same font size and also the same colour. For example, Dustin Hoffman and
    Nicole Kidman in the example below have the same font size and colour since
    both of them are popular with 88 fans. Actors with different popularities are
    shown in a different font size and also a different colour, e.g., Tom Hanks
    and Sandra Bullock in the example below have a different font size and also a
    different colour. (A possible refinement of the task is to relate the colours,
    so that more popular actors get a stronger colour, but you are not required to
    do so.)
    Your job is to develop a Python program which generates such pages using the
    data in the SQL database movie_survey. As illustrated in the Python template
    file accompanying these instructions, you must define a function called
    show_popularity which, given a list of customer categories, produces HTML
    files that can be displayed in a web browser, one per customer category.
    Another feature of the document shown in Figure 1 is that on the top of the
    page you must display the category of the customers, e.g., aged “30-40” in
    this case, and the number of customers in this group, e.g., 1549 above.
    In addition, your program must hyperlink each of the pages together, to make
    it easy for the user to search through them. In Figure 1 above we can see
    hyperlinks at the bottom of the page to the previous and next categories in
    the list. However, no ‘previous’ link is produced for the first category in a
    user’s list and no ‘next’ link is produced for the last page in a list.
    For example, if the given list of customer groups is [‘Female’, ‘Male’,
    ‘30-40’] then your function must generate three HTML pages, one for female
    customers, one for male customers, and one for customers aged between 30 and
    40, inclusive. In the male movie customers’ page, the ‘previous’ link and the
    ‘next’ link should take us to the female customers’ page and the page for
    customers aged between 30 and 40, respectively. However, for the female
    customers’ page, there is no ‘previous’ link, and for the page of customers
    aged between 30 and 40 there is no ‘next’ link.
    Figures 2 to 4 below show the expected output when we ask for the top 20
    actors preferred by fans in the category list [‘Female’, ‘Male’, ‘30-40’].
    (These three pages are the output expected for Test 1 in the template file
    fantales.py.) In each case the relative popularity of actors in each of the
    specific customer categories is shown clearly, and each of the three pages is
    hyperlinked together in the order of the list of customer categories.

Specific Tasks

The specific things you must do to complete this portfolio task are as
follows. NB: This is a large task. Although we only require you to produce one
Python function, show_popularity, we recommend that you break your solution
down into several functions to make your program code manageable and readable.
You should use the supplied template file fantales.py as a starting point.

  1. Restore the movie_survey database using the supplied dump file movie_survey.sql
  2. Write a Python function named show_popularity that has three parameters: (a) a list of strings representing a list of customer groups, (b) an integer value representing how many actors are to be displayed on the page, and (c) a string which gives a name to this particular multi-page document.
    For example, if we execute function call show_popularity([‘20-40’, ‘40-80’,
    ‘All’], 30, ‘Test03’) the first parameter says that we want three pages
    created, one for movie customers aged 20 to 40, inclusive, one for movie
    customers aged 40 to 80, and one for all movie customers. (Other valid
    customer categories are Female and Male.)
    The second parameter is an integer value which specifies how many actors’
    names to display on each page. Those displayed must be the most popular actors
    for that particular customer group.
    The third parameter is simply a name used to uniquely identify this collection
    of HTML pages. You should use it together with the customer category to create
    the name for the HTML files generated. In this particular example your program
    must produce the following three HTML files:
    Test03_20-40.html
    Test03_40-80.html
    Test03_All.html
    Actors must be displayed in alphabetical order regardless of their popularity.
    As mentioned before, each actor’s name in the database consists of a given
    name and a surname in the format of “given_name surname”, so just sort them as
    a single string.
    For example, in Figure 1, Al Pacino, Anthony Hopkins, and Arnold
    Schwarzenegger are displayed in that order, even though Arnold Schwarzenegger
    has a higher popularity than Anthony Hopkins. Each actor’s name must be
    followed with the actual number of fans derived from the database. Ensure that
    the numbers produced by your database query match those in the figures above.
    At the bottom of each page, you must have hyperlinks to the previous page and
    next page in the list, if any.
  3. Some queries will produce no results at all. In this case you should still produce an HTML page, even if there are no actors listed. An example is showed in Figure 6 below (which is one of the pages produced by Test 10 in the provided template file).
  4. As noted above, the valid customer categories are Female, Male, All and an age range. If any other category is found in the list provided as the first parameter to show_popularity a message should be printed noting the error and this item should be ignored. Pages must still be produced for all valid cases in the list, however. See the unit tests in the provided template file for examples.

Development hints

  • Before you can begin this task you must ensure that you have access to SQLite DB Browser so that you can create the database. You must also have access to the necessary SQLite-Python module so that you can call SQLite functions from Python code.
  • Given a movie customer category you will need to execute two queries on the database, one to find out how many customers there are in this category and one to get the list of actors and their popularity. The first query is relatively easy and involves the customers table only. The second involves a join of both the customers and favorite_actors tables. We recommended debugging your database queries in the DB Browser before attempting to incorporate them into your Python program.
  • Use the screenshots shown above to guide your solution, but you do not need to duplicate the precise choices of font sizes and colours. Importantly, however, your SQL queries must return exactly the same number of fans as shown in the examples above.
  • This is a large program, so it is suggested that you tackle it incrementally. Before you even begin coding you should use the DB Browser to determine exactly what query your Python program needs to execute. Then develop code to generate just one HTML page. Finally, write the code to iterate over a list of customer groups to generate all documents.
  • You need to work out how to define the font size and colour for the text to be displayed on web pages. This can be done by changing the font’s ‘style’. Refer to the lecture demonstrations and workshop exercises for how to do this.

Deliverables

You should develop your solution by completing and submitting the following
which must be contained in a zip file:

  1. Your fantales.py program, which must be an edited version of the supplied fantales.py template. This file must contain a function show_popularity which has all the characteristics defined above. When executed in the presence of an appropriately configured SQLite database, this program must produce the required HTML files.
    Complete the “statement” at the beginning of the Python file to confirm that
    this is your own individual work by inserting your name and student number in
    the places indicated. We will assume that submissions without a completed
    statement are not your own work and they will not be marked.
  2. A folder containing a complete set of generated HTML files for the supplied tests at the beginning of the template file. This will be used by the marker to confirm that your solution works in the (unlikely) event that the marker has trouble running your code.
    You do not need to submit any SQL database scripts or dumps. We will use our
    own SQL database to test your software.
    Apart from working correctly your program code must be well-presented and easy
    to understand, thanks to (sparse) commenting that explains the purpose of
    significant code segments and helpful choices of variable and function names.
    Professional presentation of your code will be taken into account when marking
    this task. See the Code Marking Guide on Blackboard under Assessment for
    details.
    If you are unable to solve the whole problem, submit whatever parts you can
    get working. You will receive partial marks for incomplete solutions.

文章作者: SafePoker
版权声明: 本博客所有文章除特別声明外,均采用 CC BY 4.0 许可协议。转载请注明来源 SafePoker !
  目录