All posts by KieonHolder
ESSAY 8
ECONOMETRICS
Conditional volatility model
Introduction
The study involved the prices of shares in the stock exchange market from 2007-01-03 till 2018-
4-30. The researcher selected a sample size of 2582 for the study. The prices’ opening, closing,
high and low values were recorded. Also, the researcher collected volumes of sales of the shares.
The data entry and cleaning were done in excel and then exported to R statistical software for
analysis. The data was converted into time-series data in R, ready for analysis. The study aimed
at obtaining the returns and fitting the generalized autoregressive conditional heteroskedasticity
(GARCH) and autoregressive conditional heteroskedasticity (ARCH). Those are the conditional
vitality model.
Results and findings
Prices
The share prices were plotted using the charSeries function in R. The data series shows that the
prices were high at the start of 2008 (around 35) but dropped significantly to the least price in the
series of around 16 at the end of the year. The highest prices were witnessed around the mid of
2016, with a price value of above 40. The plot of the prices of the shares demonstrated an
increasing trend, meaning that it was not stationary. The variance was not constant over the
series; thus, there was the presence of heteroskedasticity. Figure 1 below shows how the prices
of shares were distributed.
6
Figure 1: Time series plot of prices of shares from 2007 to 2018
Returns
The returns are the changes in the prices of investment, asset, or a project over time. It can be
represented by the real values of the price change or percentage. A positive return indicates a
profit was made, while a negative return indicates a loss. The returns of the share prices were
estimated using dailyReturn in R as shown in the appendix. The returns indicated a maximum
profit of 14% and a maximum loss of 13% over the period. The average and standard deviations
of the returns were 0. The researcher plotted returns for easy interpretation and observed that
normality existed. Figure 2 below shows that the mean of the returns was constant all through at
zero. Mid 2013 was associated with the maximum loss while mid-2014 was associated with the
maximum profit, as evidenced in the graph below;
6
Figure 2: Plot of returns of the prices of shares
Model
It is the most used measure of the risk associated with assets and investments. Volatility models
are very important when dealing with economic and financial models because of the high
fluctuations. In my case, the data was entered daily, showing how fluctuating it was. As a result,
GARCH was fitted to aid in understanding the fluctuations and making predictions on the
possibility of prices increasing or decreasing. The GARCH model of conditional variance was
fitted as GARCH (1, 1) and ARFIMA (1, 0, 1), the mean model, for the returns. The distribution
of the returns was normal; thus, the shape, skew, and lambda was not produced. As per the
Pearson adjusted goodness of fit statistics, the p-values are less than the alpha at 0.05 test of
significance; thus, we conclude that the model was statistically significant. Additionally, the
likelihood estimate was obtained to be 7425. 046. The high value of the likelihood estimate
6
indicates the model’s goodness of fit. The optimal parameters of the model were as shown by
table 1 below;
Table 1: Optimal parameters of the GARCH model
The omega, alpha, and beta were statistically significant because the absolute t statistic values
were more than 2. Also, their p-values were less than the test of significance of 0.05. The alpha
value was computed as 0.0741 and the beta value as 0.8601. The alpha value is around 0.05;
thus, the market was stable. The beta was between 0.85 and 0.98; thus, the persistence was great.
The alpha value was low, and the beta value was high; thus, the GARCH volatilities had low vol-
of-vol. The stability statistic is 1.9968 indicating a stable market existed.
Heteroskedasticity rarely affects the parameters of Ordinary Least Square (OLS) estimates.
However, it has a biasness impact on the variance matrix. To reduce that, the robust standard
errors estimates were also computed as shown by table 2 below;
Table 2: Robust standard error estimates
6
The estimates were not much different from those of the optimal parameters. Moving average
(MA) effects were statistically significant with robust standard error estimates. Omega, alpha,
and beta were also statistically significant.
The information criteria were also computed. The AIC was -5.21, and BIC was -5.19. The less
the information criteria, the better the model. The estimates are shown in table 3 below;
Table 3: The information criteria of the AGARCH model
The autocorrelation between the residuals was tested using the Ljung-Box. The hypothesis was
that;
H0: No serial correlation
H1: Serial correlation exists
The researcher obtained the p-value to be 0.873. That value is greater than the alpha at 0.05
significance test; thus, we retain the null hypothesis and conclude no serial correlation in the
residuals. The descriptive statistics of the residuals of the model were calculated. It had a range
of 0.004, a mean of 3.701E-04, and a median of 7.39E-05. Table 4 summarizes that.
6
Table 4: Descriptive statistics of squared-residuals
Figure 3 below shows the squared residuals plotted with the conditional variance. The variance
was 2.054213e-06, and the mean was 3.701e-04. The variance is constant all through the series
of residuals.
Figure 3: Plot of squared residuals and conditional variance.
6
Figure 4: The Q-Q plot of the squared residuals
Forecast
The predictions were made for a future period of 10. From the sixth value, the trend of the series
flattens. The values obtained for the series and sigma are shown respectively by the table 5
below;
Table 5: The projected values of the series and the sigma
Time Series Sigma
T1 0.00054 0.01538
T2
0.00062
3 0.0159
T3
0.00059
7 0.0159
T4
0.00060
1 0.01613
T5
0.00060
2 0.01623
6
T6
0.00060
3 0.017
T7
0.00060
3 0.01673
T8
0.00060
3 0.01689
T9
0.00060
3 0.01705
T10
0.00060
3 0.0179
The variance of the residuals in the forecast was obtained and plotted as shown by figure 5;
6
Figure 5: The sigma of the fitted projections
Conclusion
♣ The prices of the shares highly fluctuated, but the prices’ returns were constant over time.
6
♣ The models fitted were GARCH (1, 1) and ARFIMA (1, 0, 1), meaning the
autoregressive factor was 1 and the moving average also 1. The model was statistically
significant, fitting the returns of the prices of shares.
♣ The residuals of returns were normally distributed with a mean and variance of almost
zero.
♣ There exist no autocorrelation between the residual values of the returns.
♣ The projection for the next 10 days indicates that the company will make a profit over
that period and has no probability of a loss. The residuals of the projected model were
also normally distributed with set mean and sigma values.
References
6
Huskamp, H. A., Epstein, A. M., & Blumenthal, D. (2003). The impact of a national prescription
drug formulary on prices, market share, and spending: lessons for Medicare?. Health Affairs,
22(3), 149-158.
Blackman, S. C., Holden, K., & Thomas, W. A. (1994). Long-term relationships between
international share prices. Applied Financial Economics, 4(4), 297-304.
San-Juan-Rodriguez, A., Good, C. B., Heyman, R. A., Parekh, N., Shrank, W. H., & Hernandez,
I. (2019). Trends in prices, market share, and spending on self-administered disease-modifying
therapies for multiple sclerosis in Medicare Part D. JAMA neurology, 76(11), 1386-1390.
Choustova, O. (2008). Application of Bohmian mechanics to dynamics of prices of shares:
Stochastic model of Bohm–Vigier from properties of price trajectories. International Journal of
Theoretical Physics, 47(1), 252-260.
Wernerfelt, B. (1985). The dynamics of prices and market shares over the product life cycle.
Management Science, 31(8), 928-939.
charts and tables
Descriptive Statistics and Data Visualization
This study uses data from 2000 to 2021.
Table 1: Descriptive Statistics of the Crime
2
Serious crime ratestatisticspropertystatisticsviolentstatistics
Mean9.68Mean6.32Mean3.36
Standard Error0.527636238Standard Error0.444297198Standard Error0.116619
Median9.3Median5.9Median3.4
Mode9.3Mode#N/AMode3.1
Standard Deviation1.179830496Standard Deviation0.993478737Standard Deviation0.260768
Sample Variance1.392Sample Variance0.987Sample Variance0.068
Kurtosis3.646050089Kurtosis2.935394167Kurtosis-1.81228
Skewness1.742226206Skewness1.689538697Skewness0.163544
Range3.1Range2.5Range0.6
Minimum8.6Minimum5.5Minimum3.1
Maximum11.7Maximum8Maximum3.7
Sum48.4Sum31.6Sum16.8
Count5Count5Count5
As per 1000 residents, the serious crimes had mean of 9.68, median of 9.3 and mode of 9.3. The
crime on property had mean, median and mode values of 6.32, 5.9 and N/A respectively. The violent
crime rates had mean, median and mode of 3.36, 3.4 and 3.1 respectively.
From figure 1 below, it can be see clearly that there was a slight drop in all the crime rates in year 2010.
However, from 2020 to 2021, the crime rate is gradually increasing.
200020062010201920202021
Years
0
2
4
6
8
10
12
14
Crime and Incarceration
Serious crime rate (per 1,000 residents)
Serious crime rate, property (per 1,000 residents)
Serious crime rate, violent (per 1,000 residents)
Figure 1: Line Graph of Crime and Incarceration
3
0100200300400500200020062010201920202021UnitsYears
DevelopmentUnits authorized by new residential building permitsUnits issued new certificates of occupancy
Figure 2: Line Graph of the Development
As per figure 2 above, the units authorized by new residential building permits increased from
2000 up to the 2006. There was a decrease from 2006 to 2019, but then they hiked sharply up to 2021.
The units issued new certificates of occupancy decreased from 2000 up to 2006. The units increased
gradually from 2006 to 2010. From 2010 to 2019, the units issued new certificates of occupancy
increased steeply but decreased between 2019 and 2020.
200020062010201920202021
Years
$0
$200,000
$400,000
$600,000
$800,000
$1,000,000
$1,200,000
$1,400,000
$1,600,000
$1,800,000
Sales – Median Prices
Median sales price per unit, 1 family building (2021$)
Median sales price per unit, 2-4 family building (2021$)
Median sales price per unit, 5+ family building (2021$)
Median sales price per unit, condominium (2021$)
Figure 3: Line Graph of the Median Sale Prices
Figure 3 above shows median sale prices of different buildings. The median price of 1 family
building sharply increased between 2010 and 2019, and then dropped gradually between 2019 and 2020
before beginning to increasing gradually between 2020 and 2021. The price was highest in 2021.
4
The sale median price per unit for 2-4 family building increases gradually from 2006 to 2019. It
then decreases from 2019 to 2021. The price is highest in 2021. The median sale price for 5+ was low in
2010; from 2010 to 2019 it increased. From 2019 to 2021, the prices decrease gradually. The price is
highest in 2019. Considering the condominium, the prices dropped gradually between 2006 and 2010. It
then increased between 2010 and 2021. The price is highest in 2019.
200020062010201920202021
Year
0
100
200
300
400
500
600
700
Sale house price index
Index of housing price appreciation, 1 family building
Index of housing price appreciation, 2-4 family building
Index of housing price appreciation, 5+ family building
Index of housing price appreciation, condominium
Index of housing price appreciation, all property types
Figure 4: Line Graph of the Sale House Price Index
As per the figure 4 above, the index of 1 family building was highest in 2021. The index for the 2-
4 family building was highest in 2019. Considering the index of 5+ building, it was highest in 2020. The
index of the condominium was highest in 2021. The index of all property type was highest in 2019.
5
References
Smith, L. D., Best, L. A., Stubbs, D. A., Archibald, A. B., & Roberson-Nay, R. (2002). Constructing
knowledge: The role of graphs and tables in hard and soft psychology. American Psychologist, 57(10),
749.
Coll, R. A., Coll, J. H., & Thakur, G. (1994). Graphs and tables: a four-factor experiment. Communications
of the ACM, 37(4), 76-87.
Vessey, I. (1991). Cognitive fit: A theory‐based analysis of the graphs versus tables literature. Decision
sciences, 22(2), 219-240.
Ehrenberg, A. S. C. (1978). Graphs or tables. Journal of the Royal Statistical Society. Series D (The
Statistician), 27(2), 87-96.
step 6 re upload
NYU Furman center New York deals with the neighborhood data profiles, by viewing and
downloading the neighborhood indicators. They provide more information in-depth about the
demographics, housing market and the land use. Those indicators helps use in understanding the
local housing, demography trends, identify the customers’ needs and policy making.
Considering the financial district, in 2019, demographics show that there were 164, 514
individuals in the district. Out of the number, 13.8% were Asians, 3.4% were black, 11.4% were
Hispanic and 68.4% were White. That explains that majority of individuals were white and the
least were black.
There were different household groups in the financial district. The group with highest share in
2019 was $ 100, 001-$250,000 with 36.7%. Also, in 2020 it had largest share but at a slight
lower percentage of 34%.The poverty level in the financial district was 6.1% compared to the
16% of the citywide.
The real rent gross median point in the district increased from $2,330 in the year 2006 to $ 2,930
in 2019. In the year 2019, the percentage of the renter households that were at rent burden was
16.6% (most of the renters spent 50% of their income on the rent only). In the same year the
vacancy rate in the district was 4.7%.
The house ownership rate in the citywide was 31.9% in 2019 compared to 31.6% in the financial
district. There has been an increase of 6.3% in the house ownership rate in the neighborhood
since 2010. There were 9.3% mortgage that was introduced per 1000 1-4 property of the family
and the condominium units.
Security is very important in every surrounding. The serious crime rate was 16 crimes per 1000
residents in 2021 in comparison with 12.2 crimes per 1000 residents citywide.
Overall Relation to Dare Project
Drug abuse resistance education is a given program that seeks to prevent use of the controlled
drugs, violent behavior and membership in gangs. It’s my expectation that the crime rate in the
financial district would go down past the citywide. Also, it will be great if discrimination as per
race would end to reduce the stigma among the children.
STEP 5 RE UPLOAD
USA-Alaska Crime Data Comparison
Student name: Kieon Holder
Student ID: 30508632
Advantages and disadvantages of official crime data:
Official data is the set of numerical data and the government and also the agencies of government collect this data. Official data is a form of quantitative data at the secondary level and from surveys at a large scale, official data is collected. In sociological research official statistics is used. Whereas the non-official crime data numerically collect data from both the organization such as from public as well from private organizations.
Cost, as well as time, can also be saved by the official statistics and these can be accessed easily. Official statistics also give a good overview of the present society. Fairly the collection of data is due to stringent rules on a survey and it is easy for sociologists in identifying the trends.
Whereas there are some disadvantages of official data such as the official data being a social construction. The researcher cannot collect the data for which they are looking. As this is called secondary data and the researcher doesn’t know how the original data will be collected.
Insights of the crime data compared in a spreadsheet
In the given data, in Alaska in 2013 and 2014 agencies of the state submitted the data according to the definition of UCR but after the full transition in 2021 and changes and transition of Uniform Crime Reporting (UCR)to NIBRS ( National Incident-Based Reporting System) whereas for the agencies the deficiency of data and that is not completely transitioned.
Further, rape data in 2013 & 2014 in Alaska were collected and submitted by the state’s agencies according to the latest definition explained by the UCR whereas, in 2013, in the USA’s crime data, the data related to the rape cases were collected and submitted according to the both the latest definition provided and the original definition launched by the UCR. Moreover, in 1995, 168 murders and other kinds of homicides that came into effect due to the bomb blast of “The Alfred P. Murrah Federal Building in Oklahoma City” are also considered while making the national estimate. Further, in 2001, 2823 murders and other homicides were not included in the national data which came into effect due to an event that happened in September 2001 (Crime Data Explorer, n.d.).
Further, in the spreadsheet, I’ve shown the weapons used in the crimes made in USA and Alaska which shows that most crimes happened by using personal weapons. The second most used weapon in resulting crimes in the USA is the handgun whereas most crimes in Alaska resulted from using no weapon. This data also helps in identifying and reaching the murderer.
Related to the overall DARE project
This portion of the project will be very helpful in the completion of the overall DARE project as this will provide useful insights about the crimes that happened in the USA and Alaska. Further, this will also help me get further data required and use them in my further projects. I got a verified and trusted source of information to get data in more detail.
References:
Crime Data Explorer. (n.d.). Retrieved from https://crime-data-explorer.fr.cloud.gov/pages/explorer/crime/crime-trend