Ask Microeconomics Expert

Assignment

1 Lalonde NSW Data

A. Load the Lalonde experimental dataset with the lalonde data method from the module causalinference.utils. The outcome variable is earnings in 1978, and the co- variates are, in order:

Black       Indicator variable; 1 if Black, 0 otherwise.
Hispanic   Indicator variable; 1 if Hispanic, 0 otherwise.
Age         Age in years.
Married    Marital status; 1 if married, 0 otherwise. Nodegree Indicator variable; 1 if no degree, 0 otherwise. Education Years of education.
E74         Earnings in 1974.
U74         Unemployment status in 1974; 1 if unemployed, 0 otherwise.
E75         Earnings in 1975.
U75         Unemployment status in 1975; 1 if unemployed, 0 otherwise.

Using CausalModel from the module causalinference, provide summary statistics for the outcome variable and the covariates. Which covariate has the largest normalized difference?

B. Estimate the propensity score using the selection algorithm est propensity s. In se- lecting the basic covariates set, specify E74, U74, E75, and U75. What are the additional linear terms and second-order terms that were selected by the algorithm?

C. Trim the sample using trim s to get rid of observations with extreme propensity score values. What is the cut-off that is selected? How many observations are dropped as a result?

D. Stratify the sample using stratify s. How many propensity bins are created? Report the summary statistics for each bin.

E. Estimate the average treatment effect using OLS, blocking, and matching. For matching, set the number of matches to 2 and adjust for bias. How much do the estimates differ?

2 Document Classification

A. From the module sklearn.datasets, load the training data set using the method fetch 20newsgroups. This dataset comprises around 18000 newsgroups posts on 20 topics. Print out a couple sample posts and list out all the topic names.

B. Convert the posts (blobs of texts) into bag-of-word vectors. What is the dimensionality of these vectors? That is, what is the number of words that have appeared in this data set?

C. Use your favorite dimensionality reduction technique to compress these vectors into ones of K = 30 dimensions.

D. Use your favorite supervised learning model to train a model that tries to predict the topic of a post from the vectorized representation of the post you obtained in the previous step.

E. Use the test data to tune your model. Make sure to include K as a hyperparameter as well. Use accuracy score from sklearn.metrics as your evaluation metric. What is the highest accuracy you are able to achieve?

Microeconomics, Economics

  • Category:- Microeconomics
  • Reference No.:- M92798066

Have any Question?


Related Questions in Microeconomics

Question show the market for cigarettes in equilibrium

Question: Show the market for cigarettes in equilibrium, assuming that there are no laws banning smoking in public. Label the equilibrium private market price and quantity as Pm and Qm. Add whatever is needed to the mode ...

Question recycling is a relatively inexpensive solution to

Question: Recycling is a relatively inexpensive solution to much of the environmental contamination from plastics, glass, and other waste materials. Is it a sound policy to make it mandatory for everybody to recycle? The ...

Question consider two ways of protecting elephants from

Question: Consider two ways of protecting elephants from poachers in African countries. In one approach, the government sets up enormous national parks that have sufficient habitat for elephants to thrive and forbids all ...

Question suppose you want to put a dollar value on the

Question: Suppose you want to put a dollar value on the external costs of carbon emissions from a power plant. What information or data would you obtain to measure the external [not social] cost? The response must be typ ...

Question in the tradeoff between economic output and

Question: In the tradeoff between economic output and environmental protection, what do the combinations on the protection possibility curve represent? The response must be typed, single spaced, must be in times new roma ...

Question consider the case of global environmental problems

Question: Consider the case of global environmental problems that spill across international borders as a prisoner's dilemma of the sort studied in Monopolistic Competition and Oligopoly. Say that there are two countries ...

Question consider two approaches to reducing emissions of

Question: Consider two approaches to reducing emissions of CO2 into the environment from manufacturing industries in the United States. In the first approach, the U.S. government makes it a policy to use only predetermin ...

Question the state of colorado requires oil and gas

Question: The state of Colorado requires oil and gas companies who use fracking techniques to return the land to its original condition after the oil and gas extractions. Table 12.9 shows the total cost and total benefit ...

Question suppose a city releases 16 million gallons of raw

Question: Suppose a city releases 16 million gallons of raw sewage into a nearby lake. Table shows the total costs of cleaning up the sewage to different levels, together with the total benefits of doing so. (Benefits in ...

Question four firms called elm maple oak and cherry produce

Question: Four firms called Elm, Maple, Oak, and Cherry, produce wooden chairs. However, they also produce a great deal of garbage (a mixture of glue, varnish, sandpaper, and wood scraps). The first row of Table 12.6 sho ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As