variable | code | description | units |
---|---|---|---|
country | country name | ||
time | time period | ||
cons_prices | CP | consumer prices | index |
employment | EMP | employment | persons |
interest_interbank | IRSTCI | interbank interest rate | percent per annum |
interest_long | IRLT | long-term interest rate | percent per annum |
interest_short | IR3TIB | short-term interest rate | percent per annum |
money_broad | MABM | broad money (M3) | index |
money_narrow | MANM | narrow money (M1) | index |
nom_exch_rate | CC | nominal exchange rate | national currency per US dollar |
share_grow | SHARE | share prices | growth rate |
share_index | SHARE | share prices | index |
oecd | OECD country | dummy variable |
OECD Economies
Australia, Austria, Belgium, Canada, Chile, Colombia, Costa Rica, Czechia, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Israel, Italy, Japan, Korea, Latvia, Lithuania, Luxembourg, Mexico, Netherlands, New Zealand, Norway, Poland, Portugal, Slovak Republic, Slovenia, Spain, Sweden, Switzerland, Turkiye, United Kingdom, United States
Non-OECD Economies
Argentina, Brazil, China (People's Republic of), Croatia, India, Indonesia, Russia, Saudi Arabia, South Africa
Monthly Data
Jan. 1949 to Nov. 2024
notes
The dataset also contains dummy variables for each country. And for each variable, the Gretl data file also contains differences and log differences (as appropriate).
download
This dataset contains financial market data from data-explorer.oecd.org covering 38 OECD countries and 9 non-OECD countries. You may use this data to explore the effect that interest rates, exchange rates, inflation rates and the money supply have on share prices.
Specifically, you may test the null hypotheses that:
Then focus on the variables where we rejected the null hypothesis. In those cases, we have accepted the alternative hypothesis that there is a relationship, so now we want to know:
For example, if we think that interest rates will rise one percentage point next month, then how much will share prices fall in response to that change?
As you conduct your analysis, you must remember that one of the Gauss-Markov assumptions is that your residuals ("error terms") must not be correlated with each other. Related to this assumption is the concept of "stationary residuals" -- the mean and variance of your residuals must be constant over time.
Taking the difference in value between one time period and the next will usually make a series stationary, so if you difference each variable in your regression model your residuals will usually be stationary. Differencing, therefore, usually ensures that your residuals are stationary.
The alternative is to find a co-integrating relationship among your variables that makes the residuals stationary. In practice however, it is difficult to find such co-integrating relationships, so I encourage you to work with the differenced variables.
From the perspective of an investor, the share price itself is not important. What's important is the change in share price (i.e. the difference in share price).
So from an investment perspective, you want to develop a model that predicts changes in share price. What predicts those changes?
And how large is the effect of those changes on the change in share price?
As another possible project, you could forecast changes in share prices and changes in interest rates. In this case, you would not use the full panel of countries. Instead, you would perform a time-series analysis for a single country. (Or, if you want to forecast financial variables for more than one country, you would perform a time-series analysis for each country).
To isolate a single country, go to Gretl's "Sample" menu, select "Restrict based on criterion ..." and then, in the pop-up box, enter the boolean condition: united_states == 1 For your own convenience, you should make the restriction permanent and save the restricted data to a new file. Then you should go to Gretl's "Data" menu, select "Dataset structure ..." and then, in the pop-up box, select "Time series", select "Monthly" frequency and start on: 1949:01 (January 1949). Then, finally, save the dataset one more time.
When forecasting, you need to evaluate your model's predictions out-of-sample. In other words, because you're predicting future changes in financial variables, you need to simulate predictions of the future. Then you evaluate your model's forecasting performance on the simulated future.
For example, the simulated future might be the last two years of data. In that case, you would estimate the model's parameters on the data from January 1949 to December 2022. Then you evaluate its forecasts out-of-sample – from January 1949 to December 2024. To do so, go to Gretl's "Sample" menu, select "Set range ..." and set the sample range to end in 2022:12 (December 2022).
Then, after estimating your model, go to the "Analysis" menu (in the model's window), select "Forecasts" and select the variable that you wish to forecast out-of-sample. When you do, you'll have two options: "dynamic forecast" and "static forecast." Dynamic forecasts use the chained forecasts of lagged variables in the out-of-sample period. Static forecasts simply use the fitted values, even though they're computed out-of-sample.
Finally, to evaluate your model's forecasts, Gretl provides several metrics at the bottom of the forecasts window. Assuming that forecast error causes a securities trader to lose money, then you could evaluate your forecasts with the "Mean Absolute Error" to compute average total loss or you could evaluate your forecasts with the "Root Mean Squared Error," which gives more weight to large losses.
variable | code | description | units |
---|---|---|---|
country | country name | ||
year | year of observation | ||
gender_wage_gap_median | GWP | gender wage gap – diff. between median wages rel. to men's median wage | percentage |
emprate_fe | EMP_WAP | employment rate of females, age 25-54 | percentage |
emprate_ma | EMP_WAP | employment rate of males, age 25-54 | percentage |
wkapop_fe | WAP | working age females, age 25-54 | persons in thousands |
wkapop_ma | WAP | working age males, age 25-54 | persons in thousands |
eprc_v1 | EPL_OV | employment protection – dismissals | from 0 to 6 |
ept_v1 | EPL_T | employment protection – temporary | from 0 to 6 |
cpi_inflation | CPI | growth rate of consumer prices, all items non-food non-energy | percent per annum |
real_min_wage | SM_WG | statutory real minimum wages at constant prices | US dollars, PPP converted |
real_gdp_per_capita | B1GQ_R_POP | real gross domestic product per capita | US dollars per person, PPP converted |
union_density | TUD | trade union density | percentage of employees |
OECD Economies
Australia, Austria, Belgium, Canada, Chile, Colombia, Costa Rica, Czechia, Denmark, Estonia, Finland, France, Germany, Greece, Hungary, Iceland, Ireland, Israel, Italy, Japan, Korea, Latvia, Lithuania, Luxembourg, Mexico, Netherlands, New Zealand, Norway, Poland, Portugal, Slovak Republic, Slovenia, Spain, Sweden, Switzerland, Turkiye, United Kingdom, United States
Annual Data
1990 to 2019
notes
The dataset also contains dummy variables for each country and year. And for the real minimum wage and real GDP per capita variables, the Gretl data file also contains the natural log of the variables.
download
This dataset contains labor market data from data-explorer.oecd.org covering 38 OECD countries. It's similar to the dataset that we use in class to discuss employment protection. There are two differences. The first is that it covers a different set of years. The second is that this dataset also contains the "gender wage gap" – the percentage difference between median male and female wages relative to the men's median wage.
In class, we discuss the effect of labor market regulation on male and female employment rates. You might extend that analysis by exploring the effect of labor market regulation on the gender wage gap.
If you do, you would account for the simultaneity issues that arise in the supply and demand for male/female labor. Then you would find an instrumental variable to estimate the effect of labor market regulation on the gender wage gap.
In other words, you want to properly specify a model that you can use to test the null hypotheses that:
Then you would focus on the variables where we rejected the null hypothesis. In those cases, we have accepted the alternative hypothesis that there is a relationship, so now we want to know:
variable | code | description | units |
---|---|---|---|
country | country name | ||
year | year of observation | ||
edu_lesssecond | below upper secondary education | percentage of population | |
edu_secondary | upper secondary or post-secondary non-tertiary education | percentage of population | |
edu_tertiary | tertiary education | percentage of population | |
emprate | EMP_WAP | employment rate, age 25-54, both sexes | percentage |
gini | INC_DISP_GINI | Gini coefficient (based on disposable income) – measure of income inequality | from 0 to 100 |
inflows_pct | INMIG | new residents in the region coming from another country | percentage of population |
inflows_total | INMIG | new residents in the region coming from another country | total persons |
netmigration_total | NETMIG | net international migration | total inflows minus total outflows |
outflows_pct | OUTMIG | persons who left the region to reside in another country | percentage of population |
outflows_total | OUTMIG | persons who left the region to reside in another country | total persons |
pop | POP | population | total persons |
popgrow | POP | population growth rate | percentage change from previous year |
povrate | PR_INC_DISP | poverty rate – less than 50% of national median disposable income | percentage |
rgdpcap | B1GQ_R_POP | real GDP per capita | USD per person, PPP converted |
OECD Economies
Austria, Belgium, Czechia, Denmark, Estonia, Finland, Germany, Italy, Japan, Latvia, Lithuania, Netherlands, New Zealand, Norway, Poland, Portugal, Slovak Republic, Slovenia, Spain, Switzerland, Turkiye
Annual Data
1990 to 2023
notes
The dataset also contains dummy variables for each country and year. The Gretl data file also contains net migration as a percentage of the population. And the Gretl data file also contains the natural log of the real GDP per capita variable.
download
This dataset contains migration data from data-explorer.oecd.org covering 21 OECD countries. It also contains data on educational attainment, employment rates, income inequality (as measured by Gini coefficient), poverty rates, population growth rates and real GDP per capita.
If you use this dataset, you can perform an analysis of the economic factors that affect net migration. Specifically, you can test the null hypotheses that:
Then you would focus on the variables where we rejected the null hypothesis. In those cases, we have accepted the alternative hypothesis that there is a relationship, so now we want to know:
Copyright © 2002-2025 Eryk Wdowiak