Skip to main content

Importance of exploratory data analysis of dummy variables, logit/probit using eviews

By IMPRI Team 
 
IMPRI Generation Alpha Data Centre (GenAlphaDC) along with IMPRI Impact and Policy Research Institute, New Delhi conducted a Two-Week Immersive Online Hands-On Certificate Training Course on Exploratory Data Analysis with Categorical Variables Regression Models Dummy Variables and Logit/Probit using EViews, on December 10 and 17, 2022. The expert trainer for the course was Professor Nilanjan Banik, Professor at Mahindra University. He is a Visiting Consultant at IMPRI and an Academic Consultant with Geneva Network, United Kingdom, and a Senior Consultant with Hankuk University of Foreign Studies, South Korea.
The convenors for the event were Prof Vibhuti Patel, Visiting Professor at IMPRI and a Former Professor, Tata Institute of Social Sciences (TISS), Mumbai; Dr Soumyadip Chattopadhyay, Associate Professor, Economics, Visva-Bharati, Santiniketan and a Visiting Senior Fellow, IMPRI; Dr Arjun Kumar, Director, IMPRI. The training course had participants from the field of data and policy– including students, professionals, researchers, and many others.

Day 1 | December 10, 2022

The session began by going through the basics of Regression. The first question he pondered upon is what the meaning of a “dummy” is. He stated that in essence, it means a replica. Here, in a regression model, if X is a dummy variable, it means that it is a qualitative variable. He began by laying down some assumptions about the dependent and the independent variables. First, X and the Error Term (e) are not related, if related, there will be a problem of endogeneity. X is not quantitative if it is a dummy variable. He explained how we can constitute various qualitative traits in a dummy variable such as gender, and ethnicity among others in a regression model. It tries to capture the impact of any variable that is qualitative in nature.
Second, he mentioned that a dummy variable can capture any break or shift in data. He used the example of the Indian economic reforms of 1991, which was a breakpoint in terms of per capita GDP levels. After 1991 there was a big jump in GDP growth. In other words, there was a structural break. Dummy variables can capture such structural breaks. Thirdly, he mentioned that dummy variables can also be used to de-seasonalize the data. Using Excel, he showed how to incorporate dummy variables in a regression model and how dropping a dummy variable is important in order to avoid a Dummy Trap. He also showed how to de-seasonalize the data, using Excel. After de-seasonalizing the graph turned out to be more stable than before.
After explaining over Excel, he showcased the same data set on EViews. He selected the variables such as sales figures, trends, etc. He then created the dummy variables out of the four quarters. He ran the regression without the dummy first. Then he showcased a data set where he introduced a dummy and ran the regression. The data set used was US Trends in Gross Personal Income and Gross Personal Savings from 1959 to 2007. The dummy variables reflected the recession points from 1981 to 1984. The regression diagram consequently showed the breaking point due to the recession of 1981. The session ended for the day with this, after which, Professor Banik went on to take questions and clear doubts of the trainees. The next class was saved to learn Logit and Probit Models.

Day 2 | December 17, 2022

The second day of the session conducted by Professor Nilanjan Banik, titled, “Exploratory Data Analysis with Categorical Variables Regression Models: Dummy Variables and Logit/Probit using EViews” was devoted to the concepts of Logit and Probit. Professor Banik started by explaining the basic equation of a regression model, and the components within it. Here, the motive was to explain the concept of dummy variables, and the Probit/Logit model, when the variables X and Y are qualitative respectively. Then with an example of hourly wage rates, he showed how to interpret dummy variables for various categorical variables.
After explaining dummy variables, he followed it up by talking about Logit functions. He mentioned that in Logit functions, the dependent variable, Y, takes values of 1 or 0. The Logit or Probit model describes the odds of an individual meeting the outcome variable, given a certain trait or characteristic. He mentioned the importance of LR tests in Probit models. The Logit/Probit models primarily deal with the dependent variable (Y). He showed that the Y variable takes values between negative infinity to positive infinity. He proved this by showing the method to derive the value of Y using Probability. Since the P value will be between the value of 0 and 1, the Y value will take the value of negative Infinity to positive infinity.
After delving into theory, he started a practical lesson on the above discussions with the help of a data set on EViews. First, he showed how to introduce dummy variables on a set of observations. Then, Professor Banik went on to show how to interpret Logit functions on EViews. For this, he again used the previous US data on Savings and Income to show the recession point. Using other data he showed how smoking is affected by age, income, and education. He explained what the P value shows using the formula for the same. He also showed it practically based on the regression model and the results generated from it. Then he took questions from the trainees which he promptly clarified. With this, the two-day training course ended.
---
Acknowledgement: Aaswash Mahanta is a research intern at IMPRI

Comments

TRENDING

From algorithms to exploitation: New report exposes plight of India's gig workers

By Jag Jivan   The recent report, "State of Finance in India Report 2024-25," released by a coalition including the Centre for Financial Accountability, Focus on the Global South, and other organizations, paints a stark picture of India's burgeoning digital economy, particularly highlighting the exploitation faced by gig workers on platform-based services. 

'Condonation of war crimes against women and children’: IPSN on Trump’s Gaza Board

By A Representative   The India-Palestine Solidarity Network (IPSN) has strongly condemned the announcement of a proposed “Board of Peace” for Gaza and Palestine by former US President Donald J. Trump, calling it an initiative that “condones war crimes against children and women” and “rubs salt in Palestinian wounds.”

Gig workers hold online strike on republic day; nationwide protests planned on February 3

By A Representative   Gig and platform service workers across the country observed a nationwide online strike on Republic Day, responding to a call given by the Gig & Platform Service Workers Union (GIPSWU) to protest what it described as exploitation, insecurity and denial of basic worker rights in the platform economy. The union said women gig workers led the January 26 action by switching off their work apps as a mark of protest.

India’s road to sustainability: Why alternative fuels matter beyond electric vehicles

By Suyash Gupta*  India’s worsening air quality makes the shift towards clean mobility urgent. However, while electric vehicles (EVs) are central to India’s strategy, they alone cannot address the country’s diverse pollution and energy challenges.

Jayanthi Natarajan "never stood by tribals' rights" in MNC Vedanta's move to mine Niyamigiri Hills in Odisha

By A Representative The Odisha Chapter of the Campaign for Survival and Dignity (CSD), which played a vital role in the struggle for the enactment of historic Forest Rights Act, 2006 has blamed former Union environment minister Jaynaynthi Natarjan for failing to play any vital role to defend the tribals' rights in the forest areas during her tenure under the former UPA government. Countering her recent statement that she rejected environmental clearance to Vendanta, the top UK-based NMC, despite tremendous pressure from her colleagues in Cabinet and huge criticism from industry, and the claim that her decision was “upheld by the Supreme Court”, the CSD said this is simply not true, and actually she "disrespected" FRA.

Stands 'exposed': Cavalier attitude towards rushed construction of Char Dham project

By Bharat Dogra*  The nation heaved a big sigh of relief when the 41 workers trapped in the under-construction Silkyara-Barkot tunnel (Uttarkashi district of Uttarakhand) were finally rescued on November 28 after a 17-day rescue effort. All those involved in the rescue effort deserve a big thanks of the entire country. The government deserves appreciation for providing all-round support.

Whither space for the marginalised in Kerala's privately-driven townships after landslides?

By Ipshita Basu, Sudheesh R.C.  In the early hours of July 30 2024, a landslide in the Wayanad district of Kerala state, India, killed 400 people. The Punjirimattom, Mundakkai, Vellarimala and Chooralmala villages in the Western Ghats mountain range turned into a dystopian rubble of uprooted trees and debris.

Over 40% of gig workers earn below ₹15,000 a month: Economic Survey

By A Representative   The Finance Minister, Nirmala Sitharaman, while reviewing the Economic Survey in Parliament on Tuesday, highlighted the rapid growth of gig and platform workers in India. According to the Survey, the number of gig workers has increased from 7.7 million to around 12 million, marking a growth of about 55 percent. Their share in the overall workforce is projected to rise from 2 percent to 6.7 percent, with gig workers expected to contribute approximately ₹2.35 lakh crore to the GDP by 2030. The Survey also noted that over 40 percent of gig workers earn less than ₹15,000 per month.

Fragmented opposition and identity politics shaping Tamil Nadu’s 2026 election battle

By Syed Ali Mujtaba*  Tamil Nadu is set to go to the polls in April 2026, and the political battle lines are beginning to take shape. Prime Minister Narendra Modi’s visit to the state on January 23, 2026, marked the formal launch of the Bharatiya Janata Party’s campaign against the ruling Dravida Munnetra Kazhagam (DMK). Addressing multiple public meetings, the Prime Minister accused the DMK government of corruption, criminality, and dynastic politics, and called for Tamil Nadu to be “freed from DMK’s chains.” PM Modi alleged that the DMK had turned Tamil Nadu into a drug-ridden state and betrayed public trust by governing through what he described as “Corruption, Mafia and Crime,” derisively terming it “CMC rule.” He claimed that despite making numerous promises, the DMK had failed to deliver meaningful development. He also targeted what he described as the party’s dynastic character, arguing that the government functioned primarily for the benefit of a single family a...