Skip to main content

Importance of exploratory data analysis of dummy variables, logit/probit using eviews

By IMPRI Team 
 
IMPRI Generation Alpha Data Centre (GenAlphaDC) along with IMPRI Impact and Policy Research Institute, New Delhi conducted a Two-Week Immersive Online Hands-On Certificate Training Course on Exploratory Data Analysis with Categorical Variables Regression Models Dummy Variables and Logit/Probit using EViews, on December 10 and 17, 2022. The expert trainer for the course was Professor Nilanjan Banik, Professor at Mahindra University. He is a Visiting Consultant at IMPRI and an Academic Consultant with Geneva Network, United Kingdom, and a Senior Consultant with Hankuk University of Foreign Studies, South Korea.
The convenors for the event were Prof Vibhuti Patel, Visiting Professor at IMPRI and a Former Professor, Tata Institute of Social Sciences (TISS), Mumbai; Dr Soumyadip Chattopadhyay, Associate Professor, Economics, Visva-Bharati, Santiniketan and a Visiting Senior Fellow, IMPRI; Dr Arjun Kumar, Director, IMPRI. The training course had participants from the field of data and policy– including students, professionals, researchers, and many others.

Day 1 | December 10, 2022

The session began by going through the basics of Regression. The first question he pondered upon is what the meaning of a “dummy” is. He stated that in essence, it means a replica. Here, in a regression model, if X is a dummy variable, it means that it is a qualitative variable. He began by laying down some assumptions about the dependent and the independent variables. First, X and the Error Term (e) are not related, if related, there will be a problem of endogeneity. X is not quantitative if it is a dummy variable. He explained how we can constitute various qualitative traits in a dummy variable such as gender, and ethnicity among others in a regression model. It tries to capture the impact of any variable that is qualitative in nature.
Second, he mentioned that a dummy variable can capture any break or shift in data. He used the example of the Indian economic reforms of 1991, which was a breakpoint in terms of per capita GDP levels. After 1991 there was a big jump in GDP growth. In other words, there was a structural break. Dummy variables can capture such structural breaks. Thirdly, he mentioned that dummy variables can also be used to de-seasonalize the data. Using Excel, he showed how to incorporate dummy variables in a regression model and how dropping a dummy variable is important in order to avoid a Dummy Trap. He also showed how to de-seasonalize the data, using Excel. After de-seasonalizing the graph turned out to be more stable than before.
After explaining over Excel, he showcased the same data set on EViews. He selected the variables such as sales figures, trends, etc. He then created the dummy variables out of the four quarters. He ran the regression without the dummy first. Then he showcased a data set where he introduced a dummy and ran the regression. The data set used was US Trends in Gross Personal Income and Gross Personal Savings from 1959 to 2007. The dummy variables reflected the recession points from 1981 to 1984. The regression diagram consequently showed the breaking point due to the recession of 1981. The session ended for the day with this, after which, Professor Banik went on to take questions and clear doubts of the trainees. The next class was saved to learn Logit and Probit Models.

Day 2 | December 17, 2022

The second day of the session conducted by Professor Nilanjan Banik, titled, “Exploratory Data Analysis with Categorical Variables Regression Models: Dummy Variables and Logit/Probit using EViews” was devoted to the concepts of Logit and Probit. Professor Banik started by explaining the basic equation of a regression model, and the components within it. Here, the motive was to explain the concept of dummy variables, and the Probit/Logit model, when the variables X and Y are qualitative respectively. Then with an example of hourly wage rates, he showed how to interpret dummy variables for various categorical variables.
After explaining dummy variables, he followed it up by talking about Logit functions. He mentioned that in Logit functions, the dependent variable, Y, takes values of 1 or 0. The Logit or Probit model describes the odds of an individual meeting the outcome variable, given a certain trait or characteristic. He mentioned the importance of LR tests in Probit models. The Logit/Probit models primarily deal with the dependent variable (Y). He showed that the Y variable takes values between negative infinity to positive infinity. He proved this by showing the method to derive the value of Y using Probability. Since the P value will be between the value of 0 and 1, the Y value will take the value of negative Infinity to positive infinity.
After delving into theory, he started a practical lesson on the above discussions with the help of a data set on EViews. First, he showed how to introduce dummy variables on a set of observations. Then, Professor Banik went on to show how to interpret Logit functions on EViews. For this, he again used the previous US data on Savings and Income to show the recession point. Using other data he showed how smoking is affected by age, income, and education. He explained what the P value shows using the formula for the same. He also showed it practically based on the regression model and the results generated from it. Then he took questions from the trainees which he promptly clarified. With this, the two-day training course ended.
---
Acknowledgement: Aaswash Mahanta is a research intern at IMPRI

Comments

TRENDING

Is vaccine the Voldemort of modern medicine to be left undiscussed, unscrutinised?

By Deepika*    Sridhar Vembu of Zoho stirred up an internet storm by tweeting about the possible link of autism to the growing number of vaccines given to children in India . He had only asked the parents to analyse the connection but doctors, so called public health experts vehemently started opposing Vembu's claims, labeling them "dangerous misinformation" that could erode “vaccine trust”!

N-power plant at Mithi Virdi: CRZ nod is arbitrary, without jurisdiction

By Krishnakant* A case-appeal has been filed against the order of the Ministry of Environment, Forest and Climate Change (MoEF&CC) and others granting CRZ clearance for establishment of intake and outfall facility for proposed 6000 MWe Nuclear Power Plant at Mithi Virdi, District Bhavnagar, Gujarat by Nuclear Power Corporation of India Limited (NPCIL) vide order in F 11-23 /2014-IA- III dated March 3, 2015. The case-appeal in the National Green Tribunal at Western Bench at Pune is filed by Shaktisinh Gohil, Sarpanch of Jasapara; Hajabhai Dihora of Mithi Virdi; Jagrutiben Gohil of Jasapara; Krishnakant and Rohit Prajapati activist of the Paryavaran Suraksha Samiti. The National Green Tribunal (NGT) has issued a notice to the MoEF&CC, Gujarat Pollution Control Board, Gujarat Coastal Zone Management Authority, Atomic Energy Regulatory Board and Nuclear Power Corporation of India Limited (NPCIL) and case is kept for hearing on August 20, 2015. Appeal No. 23 of 2015 (WZ) is filed, a...

What happens when cricket is turned into 'dharmayudh' between India and others

By Vidya Bhushan Rawat*  India ‘lost’ the World Cup. Winning or losing is part of the game, but what happens when the game becomes part of the political propaganda and the audiences are not sports lovers but fans who hate others? An Uttar Pradesh daily gave a headline for the final game as ‘dharmyudh’.   The game of cricket is being used for political purpose. As cricket is a powerful business in the country, every non-playing dignitary in the game earns much bigger sum than the player. 

Adani Group declares it will "self-finance" Australian coal mining project: Traditional group registers fresh opposition

By  A  Representative The controversial Adani Group's Carmichael coal mine and rail project in Queensland, Australia, will be "100% financed" through the Group’s own resources, Adani, Mining CEO Lucas Dow has said. A South Asia Times, Melbourne, report has quoted Dow as saying in Queensland, “We have already invested $3.3 billion in Adani’s Australian businesses, which is a clear demonstration of our capacity to deliver a financing solution for the revised scope of the mine and rail project." Dow Pointing out that "the project stacks up both environmentally and financially", he added, "Today’s announcement removes any doubt as to the project stacking up financially... The Carmichael Project will deliver more than 1,500 direct jobs on the mine and rail projects during the initial ramp-up and construction phase, and will support thousands more indirect jobs, all of which will benefit regional Queensland communities.” The project faces fierce opposition ...

Urgent need to study cause of large number of natural deaths in Gulf countries

By Venkatesh Nayak* According to data tabled in Parliament in April 2018, there are 87.76 lakh (8.77 million) Indians in six Gulf countries, namely Bahrain, Kuwait, Oman, Qatar, Saudi Arabia and the United Arab Emirates (UAE). While replying to an Unstarred Question (#6091) raised in the Lok Sabha, the Union Minister of State for External Affairs said, during the first half of this financial year alone (between April-September 2018), blue-collared Indian workers in these countries had remitted USD 33.47 Billion back home. Not much is known about the human cost of such earnings which swell up the country’s forex reserves quietly. My recent RTI intervention and research of proceedings in Parliament has revealed that between 2012 and mid-2018 more than 24,570 Indian Workers died in these Gulf countries. This works out to an average of more than 10 deaths per day. For every US$ 1 Billion they remitted to India during the same period there were at least 117 deaths of Indian Workers in Gulf ...

New RTI draft rules inspired by citizen-unfriendly, overtly bureaucratic approach

By Venkatesh Nayak* The Department of Personnel and Training , Government of India has invited comments on a new set of Draft Rules (available in English only) to implement The Right to Information Act, 2005 . The RTI Rules were last amended in 2012 after a long period of consultation with various stakeholders. The Government’s move to put the draft RTI Rules out for people’s comments and suggestions for change is a welcome continuation of the tradition of public consultation. Positive aspects of the Draft RTI Rules While 60-65% of the Draft RTI Rules repeat the content of the 2012 RTI Rules, some new aspects deserve appreciation as they clarify the manner of implementation of key provisions of the RTI Act. These are: Provisions for dealing with non-compliance of the orders and directives of the Central Information Commission (CIC) by public authorities- this was missing in the 2012 RTI Rules. Non-compliance is increasingly becoming a major problem- two of my non-compliance cases are...

46% retailers don't know non-woven bags offered aren't eco-friendly alternative: Study

By A Representative A new study 'Environmental illusion: The non-woven bag' by the Delhi-based advocacy organisation Toxics Link, has sought to bust the myth that non-woven (NW) bags are an eco-friendly alternative to plastic bags. The study reveals that they are nothing but polypropylene (a form of plastic).

Budgam by-poll to decide if National Conference still holds the ground in J&K

By Raqif Makhdoomi   “Zoun ho Zoun ho, PDP’an Zoun ho” — the chant echoes through the streets of Budgam as election fever grips the district. Despite the dipping temperatures, people continue to gather at late-night rallies with enthusiasm. The slogan gained popularity during the 2024 assembly elections when People’s Democratic Party (PDP) leader Iltija Mufti, while campaigning, inadvertently mispronounced it as “Zoon ho Zoon ho,” a moment that went viral and has since become a fixture in local political rallies.

Himalaya disasters result of developmental paradigm being pursued in India today

By Shankar Sharma*  Yet another study report on the man made disasters in Himalayas has made serious observations on the kind of developmental paradigm being adopted in the region. It should not take any rocket science for anyone to take a stand that it is not just Himalayas which need a diligent and careful review of the kind of developmental paradigm being pursued, but the entire country is in dire need of it; especially in eco-sensitive regions such as Western Ghats, other forested areas, coastal areas, river basins, fertile agricultural lands etc. A high GDP growth rate paradigm as being pursued by the state and central governments can only bring more of such disasters all over the country sooner or later. In the context of multiple disasters striking many parts of the country with ever increasing frequency, it should become clear that our country's developmental approach has not been consistent with the geography, climate and critical needs of our people; nor are we learni...