Skip to main content

Importance of exploratory data analysis of dummy variables, logit/probit using eviews

By IMPRI Team 
 
IMPRI Generation Alpha Data Centre (GenAlphaDC) along with IMPRI Impact and Policy Research Institute, New Delhi conducted a Two-Week Immersive Online Hands-On Certificate Training Course on Exploratory Data Analysis with Categorical Variables Regression Models Dummy Variables and Logit/Probit using EViews, on December 10 and 17, 2022. The expert trainer for the course was Professor Nilanjan Banik, Professor at Mahindra University. He is a Visiting Consultant at IMPRI and an Academic Consultant with Geneva Network, United Kingdom, and a Senior Consultant with Hankuk University of Foreign Studies, South Korea.
The convenors for the event were Prof Vibhuti Patel, Visiting Professor at IMPRI and a Former Professor, Tata Institute of Social Sciences (TISS), Mumbai; Dr Soumyadip Chattopadhyay, Associate Professor, Economics, Visva-Bharati, Santiniketan and a Visiting Senior Fellow, IMPRI; Dr Arjun Kumar, Director, IMPRI. The training course had participants from the field of data and policy– including students, professionals, researchers, and many others.

Day 1 | December 10, 2022

The session began by going through the basics of Regression. The first question he pondered upon is what the meaning of a “dummy” is. He stated that in essence, it means a replica. Here, in a regression model, if X is a dummy variable, it means that it is a qualitative variable. He began by laying down some assumptions about the dependent and the independent variables. First, X and the Error Term (e) are not related, if related, there will be a problem of endogeneity. X is not quantitative if it is a dummy variable. He explained how we can constitute various qualitative traits in a dummy variable such as gender, and ethnicity among others in a regression model. It tries to capture the impact of any variable that is qualitative in nature.
Second, he mentioned that a dummy variable can capture any break or shift in data. He used the example of the Indian economic reforms of 1991, which was a breakpoint in terms of per capita GDP levels. After 1991 there was a big jump in GDP growth. In other words, there was a structural break. Dummy variables can capture such structural breaks. Thirdly, he mentioned that dummy variables can also be used to de-seasonalize the data. Using Excel, he showed how to incorporate dummy variables in a regression model and how dropping a dummy variable is important in order to avoid a Dummy Trap. He also showed how to de-seasonalize the data, using Excel. After de-seasonalizing the graph turned out to be more stable than before.
After explaining over Excel, he showcased the same data set on EViews. He selected the variables such as sales figures, trends, etc. He then created the dummy variables out of the four quarters. He ran the regression without the dummy first. Then he showcased a data set where he introduced a dummy and ran the regression. The data set used was US Trends in Gross Personal Income and Gross Personal Savings from 1959 to 2007. The dummy variables reflected the recession points from 1981 to 1984. The regression diagram consequently showed the breaking point due to the recession of 1981. The session ended for the day with this, after which, Professor Banik went on to take questions and clear doubts of the trainees. The next class was saved to learn Logit and Probit Models.

Day 2 | December 17, 2022

The second day of the session conducted by Professor Nilanjan Banik, titled, “Exploratory Data Analysis with Categorical Variables Regression Models: Dummy Variables and Logit/Probit using EViews” was devoted to the concepts of Logit and Probit. Professor Banik started by explaining the basic equation of a regression model, and the components within it. Here, the motive was to explain the concept of dummy variables, and the Probit/Logit model, when the variables X and Y are qualitative respectively. Then with an example of hourly wage rates, he showed how to interpret dummy variables for various categorical variables.
After explaining dummy variables, he followed it up by talking about Logit functions. He mentioned that in Logit functions, the dependent variable, Y, takes values of 1 or 0. The Logit or Probit model describes the odds of an individual meeting the outcome variable, given a certain trait or characteristic. He mentioned the importance of LR tests in Probit models. The Logit/Probit models primarily deal with the dependent variable (Y). He showed that the Y variable takes values between negative infinity to positive infinity. He proved this by showing the method to derive the value of Y using Probability. Since the P value will be between the value of 0 and 1, the Y value will take the value of negative Infinity to positive infinity.
After delving into theory, he started a practical lesson on the above discussions with the help of a data set on EViews. First, he showed how to introduce dummy variables on a set of observations. Then, Professor Banik went on to show how to interpret Logit functions on EViews. For this, he again used the previous US data on Savings and Income to show the recession point. Using other data he showed how smoking is affected by age, income, and education. He explained what the P value shows using the formula for the same. He also showed it practically based on the regression model and the results generated from it. Then he took questions from the trainees which he promptly clarified. With this, the two-day training course ended.
---
Acknowledgement: Aaswash Mahanta is a research intern at IMPRI

Comments

TRENDING

Plastic burning in homes threatens food, water and air across Global South: Study

By Jag Jivan  In a groundbreaking  study  spanning 26 countries across the Global South , researchers have uncovered the widespread and concerning practice of households burning plastic waste as a fuel for cooking, heating, and other domestic needs. The research, published in Nature Communications , reveals that this hazardous method of managing both waste and energy poverty is driven by systemic failures in municipal services and the unaffordability of clean alternatives, posing severe risks to human health and the environment.

From protest to proof: Why civil society must rethink environmental resistance

By Shankar Sharma*  As concerned environmentalists and informed citizens, many of us share deep unease about the way environmental governance in our country is being managed—or mismanaged. Our complaints range across sectors and regions, and most of them are legitimate. Yet a hard question confronts us: are complaints, by themselves, effective? Experience suggests they are not.

Economic superpower’s social failure? Inequality, malnutrition and crisis of India's democracy

By Vikas Meshram  India may be celebrated as one of the world’s fastest-growing economies, but a closer look at who benefits from that growth tells a starkly different story. The recently released World Inequality Report 2026 lays bare a country sharply divided by wealth, privilege and power. According to the report, nearly 65 percent of India’s total wealth is owned by the richest 10 percent of its population, while the bottom half of the country controls barely 6.4 percent. The top one percent—around 14 million people—holds more than 40 percent, the highest concentration since 1961. Meanwhile, the female labour force participation rate is a dismal 15.7 percent.

Kolkata event marks 100 years since first Communist conference in India

By Harsh Thakor*   A public assembly was held in Kolkata on December 24, 2025, to mark the centenary of the First Communist Conference in India , originally convened in Kanpur from December 26 to 28, 1925. The programme was organised by CPI (ML) New Democracy at Subodh Mallik Square on Lenin Sarani. According to the organisers, around 2,000 people attended the assembly.

From colonial mercantilism to Hindutva: New book on the making of power in Gujarat

By Rajiv Shah  Professor Ghanshyam Shah ’s latest book, “ Caste-Class Hegemony and State Power: A Study of Gujarat Politics ”, published by Routledge , is penned by one of Gujarat ’s most respected chroniclers, drawing on decades of fieldwork in the state. It seeks to dissect how caste and class factors overlap to perpetuate the hegemony of upper strata in an ostensibly democratic polity. The book probes the dominance of two main political parties in Gujarat—the Indian National Congress and the BJP—arguing that both have sustained capitalist growth while reinforcing Brahmanic hierarchies.

Urgent need to study cause of large number of natural deaths in Gulf countries

By Venkatesh Nayak* According to data tabled in Parliament in April 2018, there are 87.76 lakh (8.77 million) Indians in six Gulf countries, namely Bahrain, Kuwait, Oman, Qatar, Saudi Arabia and the United Arab Emirates (UAE). While replying to an Unstarred Question (#6091) raised in the Lok Sabha, the Union Minister of State for External Affairs said, during the first half of this financial year alone (between April-September 2018), blue-collared Indian workers in these countries had remitted USD 33.47 Billion back home. Not much is known about the human cost of such earnings which swell up the country’s forex reserves quietly. My recent RTI intervention and research of proceedings in Parliament has revealed that between 2012 and mid-2018 more than 24,570 Indian Workers died in these Gulf countries. This works out to an average of more than 10 deaths per day. For every US$ 1 Billion they remitted to India during the same period there were at least 117 deaths of Indian Workers in Gulf ...

The greatest threat to our food system: The aggressive push for GM crops

By Bharat Dogra  Thanks to the courageous resistance of several leading scientists who continue to speak the truth despite increasing pressures from the powerful GM crop and GM food lobby , the many-sided and in some contexts irreversible environmental and health impacts of GM foods and crops, as well as the highly disruptive effects of this technology on farmers, are widely known today. 

History, culture and literature of Fatehpur, UP, from where Maulana Hasrat Mohani hailed

By Vidya Bhushan Rawat*  Maulana Hasrat Mohani was a member of the Constituent Assembly and an extremely important leader of our freedom movement. Born in Unnao district of Uttar Pradesh, Hasrat Mohani's relationship with nearby district of Fatehpur is interesting and not explored much by biographers and historians. Dr Mohammad Ismail Azad Fatehpuri has written a book on Maulana Hasrat Mohani and Fatehpur. The book is in Urdu.  He has just come out with another important book, 'Hindi kee Pratham Rachna: Chandayan' authored by Mulla Daud Dalmai.' During my recent visit to Fatehpur town, I had an opportunity to meet Dr Mohammad Ismail Azad Fatehpuri and recorded a conversation with him on issues of history, culture and literature of Fatehpur. Sharing this conversation here with you. Kindly click this link. --- *Human rights defender. Facebook https://www.facebook.com/vbrawat , X @freetohumanity, Skype @vbrawat

Transgender Bill testimony of Govt of India's ‘contempt’ for marginalized community

Counterview Desk India’s civil society network, National Alliance of People’s Movements (NAPM)* has said that the controversial transgender Bill, passed in the Rajya Sabha on November 26, which happened to be the 70th anniversary of the Indian Constitution, is a reflection on the way the Government of India looks at the marginalized community with utter contempt.