ABSTRACT
Bivariate correlation was carried out on the growth and yield characters of cowpea
(Vigna unguiculata [L] Walp) and pod yield and number of pods were observed to
be positively and significantly correlated with all the components assessed
(Number of Branches, Weight of defoliated leaves, Number of Pods, Length of
Pods, Shoot Dry Weight, Plant Height, Number of Leaves, Leaf Area Plant, and
Crop Growth Rates) except leaf area index and weight of defoliated leaves. The
various parameters exhibited significant interrelationship with one another. Of the
10 growth characters considered in the study, the Principal component analysis
shows the reduction of the dimension of the variates to 3 components and these
three components explain about 95.57% of the variation. All variables considered
shows normality with a linear line indicating that the data observed multivariate
normal distribution.
TABLE OF CONTENTS
Title page…………………………………………………………………… i
Declaration………………………………………………………………… iii
Certification………………………………………………………………… iv
Dedication…………………………………………………………………… v
Acknowledgment…………………………………………………………… vi
Abstract……………………………………………………………………… vii
Table of Content …………………………………………………………… viii
Chapter One: General Introduction………………………………………… 1
1.0 Introduction ………………………………………………………… 1
1.1 Statement of the Problem …………………………………………… 3
1.2 Objective of the Study …………………………………………….. 3
1.3 Significance of the Study…………………………………………… 4
1.4 Brief History of Cowpea ………………………………………… 4
1.5 Definition of Terms………………………………………………… 5
1.6 Contribution to Knowledge ………………………………………… 9
Chapter Two: Literature Review…………………………………………… 10
2.0 Introduction ………………………………………………………… 10
2.1 Normality…………………………………………………………… 10
2.2 Path Analysis……………………………………………………….. 11
2.3 Factor Analysis ……………………………………………………… 13
2.3.1 Principal Component Analysis ……………………………………… 13
x
2.3.2 Common Factor Analysis ………………………………………… 14
Chapter Three: Methodology……………………………………………… 16
3.0 Source of Data……………………………………………………… 16
3.1 Quartile-Quartile Plot……………………………………………… 16
3.2 Population Principal Components ………………………………… 17
3.3 Factors Analysis Theory…………………………………………… 19
3.4 Path Analysis ……………………………………………………… 20
Chapter Four: Results and Discussion……………………………………… 22
4.0 Introduction………………………………………………………… 22
4.1 Q-Q Plot to test for Normality……………………………………… 23
4.2 Principal Component Scree Plot Analysis………………………… 24
4.3 Principal Components Extraction………………………………… 25
4.4 Partitioning using Path Analysis…………………………………… 27
Chapter Five: Summary, Conclusion and Recommendation……………… 29
5.1 Summary and Conclusion……………………………………………. 29
5.2 Recommendation …………………………………………………………………….. 30
References ………………………………………………………… 31
Appendixes………………………………………………………… 34
CHAPTER ONE
GENERAL INTRODUCTION
1.0 Introduction
Multivariate analysis refers to any statistical technique used to analyze data
that arises from more than one variable. This essentially models reality where each
situation, product, or decision involves more than a single variable. Multivariate
analysis can be used for both spectral and non-spectral type of data. Spectra data
are essentially data derived by the use of spectroscopic instrument. This data
specifies variable with its properties and undoubtedly provide a great deal of
useful information about organic molecules. Spectral data are used by scientists to:
(i). Discover the chemical composition of materials by looking at the high (and
other kinds of electromagnetic radiation) and (ii). Identify and monitor the
production of products in factories.
The advent of gas chromatograph (GC) machines with non-destructive
detectors has made research in flavor more interactive, and some GCs now
incorporate sniffing ports to enable trained assessors to smell individual flavor
compounds as they are separated by the GC.
A Non-spectral data is essentially data collected from sensory and
environment. Any data that is collected from other sources, than spectroscopic and
chromatographic instruments is non-spectral data.
2
The principal component analysis (PCA) and cluster analysis (CA) are the
most common multivariate statistical methods in environmental studies. Principal
component analysis is widely used to reduce data dimensionality (Salawu, 2008)
and to extract a small number of latent factors for analyzing relationships among
the observed variables. If large differences exist in the standard deviations of
variables, PCA result will vary considerably depending on whether the correlation
or covariance matrix is used (Farnham et al, 2003).
Factor analysis (FA) is a statistical approach that can be used to analyze
interrelationship among a large number of variables and to explain these variables
in terms of their common underlying dimension (factors). The statistical approach
involving finding a way of condensing the information contained in a number of
original variables into a smaller set of dimension (factors) with a minimum loss of
information (Everith and Dunn, 2001).
Agronomy is the study of soils and plants, relating to the scientific study of
soils management, land cultivation and crop production. It describes plants
characteristics that are important during growth and development of a crop e.g
height and stem strength. The data on cowpea was collected and used for the
analysis.
3
1.1 Statement of the Problem
Over the years the measurable dimensions of agricultural trial are
numerous. This leads to cumbersome and time consuming tasks in the gathering
and analysis of data from agricultural experiments.
Efforts were made in this thesis to reduce the dimensionality of the
measurement from m to p, (p < m) such that maximum information will be
retained and large percent variation will be explained with the use of fewer
variables.
1.2 Objective of the Study
The specific objectives of the study include:-
To test the multivariate normality assumption on agronomic trial.
To study the relationship of the agronomic measured characteristics in
both the growth and yield parameter.
To determine the percent contribution of both direct and indirect
correlation in path analysis.
To reduce the dimensionality of the measured characteristics from m to
p; where p < m.
4
1.3 Significance of the Study
The Significance of this study is that the model constructed will provide:
The agriculturist with useful information on the most prominent
characteristics to use in the crop management.
The agriculturist with information on direct and indirect contributions of
the crop characteristics
Demonstration of the application of Path Analysis and Principal
Component Analysis in agricultural experiments to obtain useful
information.
1.4 Brief History of Cowpea
Cowpea (vigna Unguiculata (L) walp), common name for any of a genus of
leguminous herbs also knows as black-eyed pea. Cowpeas are sprawling or
twining herbs with triple leaves and pods 20 to 30 cm long enclosing several
kidney-shaped seeds. Cowpeas were cowpea originally native of Asia are now an
important forage and cover crops in Southern United State and Africa. Worldwide
Cowpea production has increased dramatically in the last 25 years (Adebayo and
Tukur, 1999).
Being a drought tolerant and warm weather crops, cowpea is well as
adapted to the drier regions of the tropics where other food legumes do not
perform well.
5
It also has the unique ability to fix atmospheric nitrogen through its nodules
and it grows well even in the soil with more than 85% sand and with less than
0.2% organic matter and low level of phosphorous, where soil PH is in the range
of 5.5 to 6.5 (Tuan and Philips, 1992).
Cowpea is consumed in many forms: young leaves, green pods, green seeds
are used as vegetables and dry seed are used in various food preparations. With
over 25% protein, in its seeds and tender leaves, cowpea is a major source of
protein, minerals and vitamins in the daily diets. Therefore Cowpea seed is valued
as a nutritional supplement to cereals and animals. Its seeds is a nutritious
component in the human diet as well as livestock feed with nutrient content as
follows: Protein, 24.8%, Fat, 1.9% Fiber, 6.3% Carbohydrate, 63.6%, Thiamine,
0.00074%, Riboflavin, 0.00042% and Niacin, 0.00281%, thus its positively
impacts on the health of women and children (Van and Gerike, 2000).
1.5 Definition of Terms and Concepts.
Variance (MS): The term MS (mean sum of square) is a short form of mean sum
of squares (MSS), which is calculated by dividing the sum of squares with its
degrees of freedom. One of the ‘S’was dropped in usage and its short form MS is
used. It is the most important measure of dispersion. In fact it has a very crucial
role in biostatistical analysis of experimental data and in the tests of significance.
The MS in the analysis of variance table is the same as variance. Mathematically,
it is square of standard deviation.
6
Coefficient of Variation (C.V): It is defined as the calculated standard deviation
expressed as a percentage of the mean.
Completely Randomized Design: The simplest experimental design used where
experimental material is uniform e.g. in the laboratory, green house, screen house,
growth chamber.
Continuous Variable: They are variables which can have any values between
certain limits e.g. yield, weight etc.
Number of Pods: The average number of pods in a stand of cowpea.
Length of Pods: The length of pods from the base to the tips of the pod. Shoot
Dry Weigh
Plant Height: It’s the measurement of cowpea stand from the ground to the
longest leaf tip or panicle.
Number of Leaves: This will be determined by counting the total number of fully
expanded leaves on the tagged plants and the average will be computed.
Leaf Area Plant: This will be determined by measuring the length and maximum
width of the leaves of five tagged plants and multiplied by a constant coefficient of
0.75 (Montgomery, 1911)
Leaf Area Index: This will be determined by measuring the length and breadth
from the widest portion of functional leaves of each leaf from the tagged plant
with a ruler. The product of length and breadth will then be multiplied by a factor,
7
0.75 from which area of individual leaves of the sampled plants will be obtained,
added and divided by the land area occupied by the tagged plants.
LAI =
P
A = leaf area per plant / Area subtended by plant
Where, A = Leaf area per plant (m2)
P = Ground area per plant (m2)
LAI = leaf area index.
Crop Growth Rates: Three plants will be randomly sampled from each sub-plot
for destructive sampling these will be oven dry at 700c for 24 hours using the
formula by Smith and Haddad, (2000). The unit of measurement is g cm -2 wk-1.
( ) 2 1
2 1
t t
CGR W W
1.x
W1 = dry matter weight at a sampling period
W2 = dry matter weight at the next sampling period
t1 = time at which w1 was taken
t2 = time at which w2 was taken
Path Model: A path model is a diagram relating intermediary, and dependent
variables. Single arrows indicate causation between exogenous or intermediary
variables and the dependent(s). Arrows also connect the error terms with the
respective endogenous variables. Double arrows indicate correlation between pairs
of exogenous variables.
8
Endogenous Variables: These are variables which form an internal part of the
system. Endogenous variables are those which do have incoming arrows.
Endogenous variables include intervening casual arrows in the path diagram. The
dependent variable(s) have only incoming arrows.
Exogenous Variables: These are variables which form an external part of the
system. Exogenous variables are those with no explicit causes (no arrows going to
them, other than the measurement error term). If exogenous variables are
correlated, this is indicated by a double-headed arrow connecting them.
Causal Paths: To a given variable include (1) the direct paths from arrows
leading to it, and (2) correlated paths from endogenous variables with others have
arrows leading to the given variable.
Path Coefficient: A path coefficient is a standardized regression coefficient (beta)
showing the direct effect of an independent variable on a dependent variable in the
path model. Thus when the model has two or more causal variables, path
coefficients are partial regression coefficients which measure the extent of effect
of one variable on another in the path model controlling for other prior variables,
using standardized data or a correlation matrix as input.
Effect Decomposition: Path coefficients may be used to decompose correlations
in the model into direct and indirect effects. This is based on the rule that in a
linear system, the total causal effect of variable i on variable j is the sum of the
values of all the paths from i to j.
9
Disturbance Terms: The residual error terms, reflect unexplained variance (the
effect of unmeasured variables) plus measurement error.
1.6 Contribution to Knowledge
This research x-rays the application of Principal Component Analysis and
Path Analysis in the reduction and partitioning of agricultural data.
Do you need help? Talk to us right now: (+234) 08060082010, 08107932631 (Call/WhatsApp). Email: [email protected].
IF YOU CAN'T FIND YOUR TOPIC, CLICK HERE TO HIRE A WRITER»