Skip to main content

Importance of exploratory data analysis of dummy variables, logit/probit using eviews

By IMPRI Team 
 
IMPRI Generation Alpha Data Centre (GenAlphaDC) along with IMPRI Impact and Policy Research Institute, New Delhi conducted a Two-Week Immersive Online Hands-On Certificate Training Course on Exploratory Data Analysis with Categorical Variables Regression Models Dummy Variables and Logit/Probit using EViews, on December 10 and 17, 2022. The expert trainer for the course was Professor Nilanjan Banik, Professor at Mahindra University. He is a Visiting Consultant at IMPRI and an Academic Consultant with Geneva Network, United Kingdom, and a Senior Consultant with Hankuk University of Foreign Studies, South Korea.
The convenors for the event were Prof Vibhuti Patel, Visiting Professor at IMPRI and a Former Professor, Tata Institute of Social Sciences (TISS), Mumbai; Dr Soumyadip Chattopadhyay, Associate Professor, Economics, Visva-Bharati, Santiniketan and a Visiting Senior Fellow, IMPRI; Dr Arjun Kumar, Director, IMPRI. The training course had participants from the field of data and policy– including students, professionals, researchers, and many others.

Day 1 | December 10, 2022

The session began by going through the basics of Regression. The first question he pondered upon is what the meaning of a “dummy” is. He stated that in essence, it means a replica. Here, in a regression model, if X is a dummy variable, it means that it is a qualitative variable. He began by laying down some assumptions about the dependent and the independent variables. First, X and the Error Term (e) are not related, if related, there will be a problem of endogeneity. X is not quantitative if it is a dummy variable. He explained how we can constitute various qualitative traits in a dummy variable such as gender, and ethnicity among others in a regression model. It tries to capture the impact of any variable that is qualitative in nature.
Second, he mentioned that a dummy variable can capture any break or shift in data. He used the example of the Indian economic reforms of 1991, which was a breakpoint in terms of per capita GDP levels. After 1991 there was a big jump in GDP growth. In other words, there was a structural break. Dummy variables can capture such structural breaks. Thirdly, he mentioned that dummy variables can also be used to de-seasonalize the data. Using Excel, he showed how to incorporate dummy variables in a regression model and how dropping a dummy variable is important in order to avoid a Dummy Trap. He also showed how to de-seasonalize the data, using Excel. After de-seasonalizing the graph turned out to be more stable than before.
After explaining over Excel, he showcased the same data set on EViews. He selected the variables such as sales figures, trends, etc. He then created the dummy variables out of the four quarters. He ran the regression without the dummy first. Then he showcased a data set where he introduced a dummy and ran the regression. The data set used was US Trends in Gross Personal Income and Gross Personal Savings from 1959 to 2007. The dummy variables reflected the recession points from 1981 to 1984. The regression diagram consequently showed the breaking point due to the recession of 1981. The session ended for the day with this, after which, Professor Banik went on to take questions and clear doubts of the trainees. The next class was saved to learn Logit and Probit Models.

Day 2 | December 17, 2022

The second day of the session conducted by Professor Nilanjan Banik, titled, “Exploratory Data Analysis with Categorical Variables Regression Models: Dummy Variables and Logit/Probit using EViews” was devoted to the concepts of Logit and Probit. Professor Banik started by explaining the basic equation of a regression model, and the components within it. Here, the motive was to explain the concept of dummy variables, and the Probit/Logit model, when the variables X and Y are qualitative respectively. Then with an example of hourly wage rates, he showed how to interpret dummy variables for various categorical variables.
After explaining dummy variables, he followed it up by talking about Logit functions. He mentioned that in Logit functions, the dependent variable, Y, takes values of 1 or 0. The Logit or Probit model describes the odds of an individual meeting the outcome variable, given a certain trait or characteristic. He mentioned the importance of LR tests in Probit models. The Logit/Probit models primarily deal with the dependent variable (Y). He showed that the Y variable takes values between negative infinity to positive infinity. He proved this by showing the method to derive the value of Y using Probability. Since the P value will be between the value of 0 and 1, the Y value will take the value of negative Infinity to positive infinity.
After delving into theory, he started a practical lesson on the above discussions with the help of a data set on EViews. First, he showed how to introduce dummy variables on a set of observations. Then, Professor Banik went on to show how to interpret Logit functions on EViews. For this, he again used the previous US data on Savings and Income to show the recession point. Using other data he showed how smoking is affected by age, income, and education. He explained what the P value shows using the formula for the same. He also showed it practically based on the regression model and the results generated from it. Then he took questions from the trainees which he promptly clarified. With this, the two-day training course ended.
---
Acknowledgement: Aaswash Mahanta is a research intern at IMPRI

Comments

TRENDING

Job opportunities decreasing, wages remain low: Delhi construction workers' plight

By Bharat Dogra*   It was about 32 years back that a hut colony in posh Prashant Vihar area of Delhi was demolished. It was after a great struggle that the people evicted from here could get alternative plots that were not too far away from their earlier colony. Nirmana, an organization of construction workers, played an important role in helping the evicted people to get this alternative land. At that time it was a big relief to get this alternative land, even though the plots given to them were very small ones of 10X8 feet size. The people worked hard to construct new houses, often constructing two floors so that the family could be accommodated in the small plots. However a recent visit revealed that people are rather disheartened now by a number of adverse factors. They have not been given the proper allotment papers yet. There is still no sewer system here. They have to use public toilets constructed some distance away which can sometimes be quite messy. There is still no...

India's health workers have no legal right for their protection, regrets NGO network

Counterview Desk In a letter to Union labour and employment minister Santosh Gangwar, the civil rights group Occupational and Environmental Health Network of India (OEHNI), writing against the backdrop of strike by Bhabha hospital heath care workers, has insisted that they should be given “clear legal right for their protection”.

Uttarakhand tunnel disaster: 'Question mark' on rescue plan, appraisal, construction

By Bhim Singh Rawat*  As many as 40 workers were trapped inside Barkot-Silkyara tunnel in Uttarkashi after a portion of the 4.5 km long, supposedly completed portion of the tunnel, collapsed early morning on Sunday, Nov 12, 2023. The incident has once again raised several questions over negligence in planning, appraisal and construction, absence of emergency rescue plan, violations of labour laws and environmental norms resulting in this avoidable accident.

Rally in Patna: Non-farmer bodies to highlight plight of agriculture in Eastern India ahead of march to Parliament

P Sainath By  A  Representative Ahead of the march to Parliament on November 29-30, 2018, organized by over 210 farmer and agricultural worker organisations of the country demanding a 21-day special session of Parliament to deliberate on remedial measures for safeguarding the interest of farm, farmers and agricultural workers, a mass rally been organized for November 23, Gandhi Sangrahalaya (Gandhi Museum), Gandhi Maidan, Patna. Say the organizers, the Eastern region merits special attention, because, while crisis of farmers and agricultural workers in Western, Southern and Northern India has received some attention in the media and central legislature, the plight of those in the Eastern region of the country (Bihar, Jharkhand, West Bengal, Orissa, Chhattisgarh and Eastern UP) has remained on the margins. To be addressed by P Sainath, founder of People’s Archive of Rural India (PARI), a statement issued ahead of the rally says, the Eastern India was the most prosperous regi...

A comrade in culture and controversy: Yao Wenyuan’s revolutionary legacy

By Harsh Thakor*  This year marks two important anniversaries in Chinese revolutionary history—the 20th death anniversary of Yao Wenyuan, and the 50th anniversary of his seminal essay "On the Social Basis of the Lin Biao Anti-Party Clique". These milestones invite reflection on the man whose pen ignited the first sparks of the Great Proletarian Cultural Revolution and whose sharp ideological interventions left an indelible imprint on the political and cultural landscape of socialist China.

'MGNREGA crisis deepening': NSM demands fair wages and end to digital exclusions

By A Representative   The NREGA Sangharsh Morcha (NSM), a coalition of independent unions of MGNREGA workers, has warned that the Mahatma Gandhi National Rural Employment Guarantee Act (MGNREGA) is facing a “severe crisis” due to persistent neglect and restrictive measures imposed by the Union Government.

As 2024 draws nearer, threatening signs appear of more destructive wars

By Bharat Dogra  The four years from 2020 to 2023 have been very difficult and high risk years for humanity. In the first two years there was a pandemic and such severe disruption of social and economic life that countless people have not yet recovered from its many-sided adverse impacts. In the next two years there were outbreaks of two very high-risk wars which have worldwide implications including escalation into much wider conflicts. In addition there were highly threatening signs of increasing possibility of other very destructive wars. As the year 2023 appears to be headed for ending on a very grim note, there are apprehensions about what the next year 2024 may bring, and there are several kinds of fears. However to come back to the year 2020 first, the pandemic harmed and threatened a very large number of people. No less harmful was the fear epidemic, the epidemic of increasing mental stress and the cruel disruption of the life and livelihoods particularly among the weaker s...

Arun Kamal’s poetry as conscience: Beauty, ugliness, and the sociology of resistance

By Ravi Ranjan*  Poetry in India has never been only about beauty. It has been conscience, witness, and resistance, an art form that breathes life into the anxieties of society while also holding up a mirror to its contradictions. From the ecstatic devotional voices of Kabir and Mirabai to the realism of modern poets who turned their gaze on exploitation and injustice, verse has spoken both for the self and for the collective. In this long lineage, Arun Kamal stands out as a poet who does not merely compose verses but also reflects deeply on the very function of poetry. His poetry and criticism together reveal him as a figure who, in Rajasekhara’s words, is both gold and touchstone—creator and critic in one.

Green dreams, harsh realities: Why India’s eco-friendly projects face an uncertain future

By N.S. Venkataraman*  Around the world, policy makers and scientists agree that the long-term solution to environmental degradation and the climate crisis lies in scaling up renewable energy and launching eco-friendly projects such as green hydrogen, green ammonia, and green methanol. These initiatives are seen as vital in reducing harmful emissions of carbon dioxide, sulphur dioxide, and nitrous oxide by moving away from fossil fuels. On paper, the idea is flawless. In practice, however, the future of these projects is clouded with uncertainties.