subject

Requirements : you are to create a program in python that performs the following using the pandas packages: 1. loads the ss13hil. csv file that contains the pums dataset (assume it's in the current directory) and create a dataframe object from it.2. create 3 tables: table 1: statistics of hincp - household income (past 12 months), grouped by hht - household/family type table should use the hht types (text descriptions) as the index columns should be: mean, std, count, min, max rows should be sorted by the mean column value in descending ordertable 2: hhl - household language vs. access - access to the internet (frequency table) table should use the hhl types (text descriptions) as the index columns should the text descriptions of access values each table entry is the sum of wgtp column for the given hhl/access combination, divided by the sum of wgtp values in the data. entries need to be formatted as percentages. table should include marginal values ('all' row and column). any rows containing na values in hhl, access, or wgtp columns should be excluded. table 3: quantile analysis of hincp - household income (past 12 months) rows should correspond to different quantiles of hincp: low (0-1/3), medium (1/3-2/3), high (2/3-1) columns displayed should be: min, max, mean, household_count the household_count column contains entries with the sum of wgtp values for the corresponding range of hincp values (low, medium, or high)3. display the tables to the screen as shown in the sample output on the last page. additional requirements1. the name of your source code file should be tables. py. all your code should be within a single file. 2. you need to use the pandas dataframe object for storing and manipulating data.3. your code should follow good coding practices, including good use of whitespace and use of both inline and block comments.4. you need to use meaningful identifier names that conform to standard naming conventions.5. at the top of each file, you need to put in a block comment with the following information: your name, date, course name, semester, and assignment name.6. the output should exactly match the sample output shown on the last page. what to turn inyou will turn in the single tables. py filehints to get the right output, use the following functions to set pandas display parameters: pd. set_option('display. max_columns', 500)pd. set_option('display. width', 1000) to display entries as percentages, use the applymap method, giving it a string conversion function as input. the string conversion function should take a float value v as an input and output a string representing v as a percentage. to do this, you can use formatting strings or the format() method

ansver
Answers: 3

Other questions on the subject: Computers and Technology

image
Computers and Technology, 22.06.2019 22:00, delawdermia27
Researchers measured the data speeds for a particular smartphone carrier at 50 airports. the highest speed measured was 78.1 mbps. the complete list of 50 data speeds has a mean of x overbarequals16.11 mbps and a standard deviation of sequals18.65 mbps. a. what is the difference between carrier's highest data speed and the mean of all 50 data speeds? b. how many standard deviations is that [the difference found in part (a)]? c. convert the carrier's highest data speed to a z score. d. if we consider data speeds that convert to z scores between minus2 and 2 to be neither significantly low nor significantly high, is the carrier's highest data speed significant? a. the difference is nothing mbps.
Answers: 3
image
Computers and Technology, 22.06.2019 23:30, Nason156
What are some ways to use a range name in a formula? check all that apply. in the defined names group, click use in formula, and then select the desired name. begin typing the name in the formula, select a name from the autocomplete list, and use the arrow keys and tab key to enter the name in the formula. begin typing the formula, and then click and drag with the mouse to select the cells to include in the formula. right-click one of the cells in the range. click formula options, and use the dialog box to add the name.
Answers: 1
image
Computers and Technology, 23.06.2019 10:50, whyidkmyself
Your friend kayla is starting her own business and asks you whether she should set it up as a p2p network or as a client-server network. list three questions you might ask to kayla decide which network to use and how her answers to those questions would affect your recommendation.
Answers: 2
image
Computers and Technology, 24.06.2019 16:30, officialrogerfp3gf2s
Pressing the backspace key deletes the text to the of the insertion point. the left or the right?
Answers: 1
You know the right answer?
Requirements : you are to create a program in python that performs the following using the pandas pa...

Questions in other subjects:

Konu
Mathematics, 27.01.2021 19:40
Konu
Mathematics, 27.01.2021 19:40