Ask Question, Ask an Expert

+61-413 786 465

info@mywordsolution.com

Ask Management Information System Expert

Task steps:

1. Create an author-to-author tweet edge file from the original data set, stocktwit_graph_input.csv.

Create an edge file from the original data set, stocktwit_graph_input.csv. We just need two columns - source (Vertex 1) and target (Vertex

2) of an edge to create a graph. Select all rows - tweets for columns K- "from_person" and M - "to_person" (or J and L for numerical author IDs) and save it as "stocktwit_from_to" or another name you prefer.

2. Use Gephi to generate and save author (node) metrics. Select the metrics you like to explore and use for building models later. Include at least 5 different metrics.

a. Which three authors have the highest betweenness centrality?

b. Which three authors have the highest total degree?

c. Which three authors have the highest closeness?

3. Build the Node Table for Prediction

(1). Open the stocktwit_node.csv file in Excel, and create a new variable: Expert (i.e. suggested). It is the target variables we aim to classify or predict.

(2). Do not close the stocktwit_node.csv file. Open the stocktwit_graph_input.csv file. And then go to the stocktwit_node.csv.

(3). Note that the unit in the stocktwit_node.csv file is a node (i.e. each individual author) and the unit in the stocktwit_graph_input.csv file is a tweet (i.e. each message). So, in order to transfer the value of expert from the table of stocktwit_graph_input to the stocktwit_node table, we need to do data transformation.

To Expert, we need to assign one value to one author (i.e. whether they are expert or not - 1 stands for yes; 0 stands for no.).

Use the VLOOKUP function to assign the value of "suggested" from the table of stocktwit_graph_input to the column, "Expert", in stocktwit_node table. The function for the first row should be like this:

= VLOOKUP(A2, stocktwit_graph_input.csv!$K$1:$AB$38200,18,FALSE),

where "A2" is the node name; "stocktwit_graph_input.csv!$K$1:$AB$38200" is the table range we look up; 18 is the column number from the table range that we aim to return the value, "FALSE" stands for an exact match of the value.

(4). Save the stocktwit_node.csv file. BTW, you can delete those rows who have missing value in Expert, because these nodes only appear in the "to_person" column, they do not have tweets.

Use filter function in excel to remove the #NAs.

4. In R, build and evaluate a classification model that uses the metrics in stocktwit_node_yourname.csv from step 2 as features to classify authors into "expert" stocktwit author (i.e., "suggested"=1)" or not ("suggested"=0) which is the target label variable.

(1). Using a seed of 100, randomly select 60% of the rows into training (e.g. called traindata). Divide the other 40% of the rows evenly into two holdout test/validation sets (e.g., called testdata1 and testdata2).

(2). Build the tree using the C50 function with default settings.

(3). Generate predictions (i.e. estimations) of the values of the target variable for the testing instances.

Generate a confusion matrix that shows the counts of true-positive, true-negative, false-positive and false-negative predictions for both testdata1 and testdata2. Consider 1 as positive class.

Generate seven performance metrics - Accuracy (percent of all correctly classified testing instances), and precision (percent of instances predicted to have a class are accurate), recall (also true positive) and F-measure (also F-score) of the two classes of expert.

(4). Would you recommend using the features from network analysis to identify experts in the Stocktwit community? Why or why not?

Attachment:- stocktwit_graph_input.rar

Management Information System, Management Studies

  • Category:- Management Information System
  • Reference No.:- M92577543
  • Price:- $100

Priced at Now at $100, Verified Solution

Have any Question?


Related Questions in Management Information System

Knowledge management systems and crmin answer to the

KNOWLEDGE MANAGEMENT SYSTEMS AND CRM In answer to the challenges Nelnet faces in servicing a growing volume of student loans, the company chose to deploy a knowledge management system called OpenText Process Suite. Go on ...

Assignment the need for wireless standards and

Assignment : The Need for Wireless Standards and Protocols The networking field, to include wireless networking, defines many standards to govern network and wireless network operations. It is important to become familia ...

Background kirk 2016 designed his text to help understand

Background: Kirk (2016) designed his text to help understand the four steps involved in working with data. Kirk (2016) Discuss the following working with data steps: Data acquisition, data examination, data transformatio ...

Ransomwareto pay or not to pay when it comes to corporate

Ransomware: To pay or not to pay? When it comes to corporate data, should corporations pay? Can you trust paying? What can be done to protect against ransomware? Would you pay if it were your own personal data? How can y ...

1-consider how deming and tqm would have dealt with the

1- Consider how Deming and TQM would have dealt with the problems at Boeing () 2 - What Does a TQM initiative look like in an IT dept? 3 - How would IT support total quality at Boeing? (can summarize these above 3 questi ...

Part 1 - create an 8 slide powerpoint presentation on

Part 1 - Create an 8 slide PowerPoint presentation on foundational concepts specific to physical security. Part 2 - Write 4 pages detailing the framework for the design of an integrated data center. Assessment Instructio ...

Question what is the resolution and unification algorithm

Question : What is the resolution and unification algorithm, and what is an algorithm? What is the Turing test, and who is Alan Turing? What is a neural network? Can machines really demonstrate intelligence?

In roughly 200 words -discuss how the roles and functions

In roughly 200 words - Discuss how the roles and functions of IS governance are changing or should change, as a company considers Cloud and Big Data migrations (Hints: focus on information quality, information systems an ...

Background kirk 2016 designed his text to help understand

Background: Kirk (2016) designed his text to help understand the four steps involved in working with data. Kirk (2016) discusses the following working with data steps: Data acquisition, data examination, data transformat ...

Write a 2 page paper that discusses what policies were

Write a 2 page paper that discusses what policies were missing in the particular case. Do additional research than what was provided in the text. Use APA format Cite your sources. •Private Sector •Target Corporation •1,7 ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As