group 5 cluster analysis

Upload: mansi-gupta

Post on 07-Apr-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/6/2019 Group 5 Cluster Analysis

    1/13

    CLUSTER ANALYSIS

    PRESENTED BY:-

    Garima Anand(34)

    Sarabjeet kour(44)

    Supriya koul(59)

    Priyanshu Gupta(60)

  • 8/6/2019 Group 5 Cluster Analysis

    2/13

    CONCEPT

    Cluster analysis is a class of statistical techniques Cluster analysis is an exploratory data analysis tool.

    Cluster analysis sorts through the raw data.

    Acluster is a group of relatively homogeneous cases

    or observations. Cluster analysis, is an interdependence technique.

    Cluster analysis reduces the number of observations

    or cases.

    Example: A group of diners sharing the same table in a

    restaurant may be regarded as a cluster of people. In

    food stores items of similar nature, such as different

    types of meat or vegetables are displayed in the same or

    nearby locations.

  • 8/6/2019 Group 5 Cluster Analysis

    3/13

    HYPOTHETICAL EXAMPLE

    No. of vacation days

    I

    II

    III

    Expenditure on vacations(Rs.)

    Vacations by 15 individuals(A To O ).

    Three differentclusters ( I, II & III ) .

    .C .M

    .H

    .F .L.N

    .O .K

    .G .D

    .I

    .A

    .B .E

    .J

    To classify individuals or objects on the basis of their similarity

    or distance from each other . Distance is in inverse measure of

    similarity .

  • 8/6/2019 Group 5 Cluster Analysis

    4/13

    BASIC PROCEDURE OF CLUSTER ANALYSIS

    1. Formulate the problem .

    2. Select a distance measure :

    y Squared Euclidean distance .

    y Manhattan distance .

    y Chebyshev distance .

    y Mahalanobis (or correlation) distance .

    3. Select a clustering procedure .4. Decide on the number of clusters .

    5. Map and interpret clusters (draw conclusions ).

    6. Assess reliability and validity :

    y Repeat analysis but use different distance measure .

    y Repeat analysis but use different clustering technique .

    y Split the data randomly into two halves and analyze each partseparately .

    y Repeat analysis several times, deleting one variable each time

    y Repeat analysis several times, using a different order eachtime .

  • 8/6/2019 Group 5 Cluster Analysis

    5/13

    Clustering Methods

    Clustering methods are categorized as:

    Non-Hierarchicalclustering

    Hierarchicalclustering

  • 8/6/2019 Group 5 Cluster Analysis

    6/13

    Cont..

    Non-Hierarchicalclustering:

    first determine a cluster center,

    then group all objects that are within a certain distance

  • 8/6/2019 Group 5 Cluster Analysis

    7/13

    Examples

    o Sequential Threshold method - first determine a cluster center, then

    group all objects that are within a predetermined threshold from the

    center.Only One cluster is created at a time.

    o Parallel Threshold method - several cluster centers are determined

    simultaneously, then objects that are within a predetermined threshold

    from the centers are grouped.

    o Optimizing Partitioning method - first a non-hierarchical procedure isrun, then objects are reassigned so as to optimize an overall criterion

    o Centroid methods - clusters are generated that maximize the distance between the

    centers of clusters (a centroid is the mean value for all the objects in the cluster)

    o Variance methods - clusters are generated that minimize the within-cluster

    variance

  • 8/6/2019 Group 5 Cluster Analysis

    8/13

    Cont..

    Hierarchicalclustering

    objects are organized into an hierarchical

    structure as part of the procedure

    Examples:a) Divisiveclustering - start by treating all objects as if they are

    part of a single large cluster, then divide the cluster into smaller

    and smaller clusters.

    b) Agglomerativeclustering - start by treating each object as a

    separate cluster, then group them into bigger and bigger clustersexamples:

  • 8/6/2019 Group 5 Cluster Analysis

    9/13

    Cont..

    c) Linkage methods cluster of objects are based on the distance

    between them

    examples:

    o Single Linkage method - cluster objects based on the minimum

    distance between them (also called the nearest neighbour rule)

    o Complete Linkage method - cluster objects based on the maximum

    distance between them (also called the furthest neighbour rule)

    o Average Linkage method - cluster objects based on the average

    distance between all pairs of objects (one member of the pair must be from

    a different cluster)

  • 8/6/2019 Group 5 Cluster Analysis

    10/13

    ADVANTAGES Of CLUSTER

    ANALYSIS IN MARKETING:-

    Market segmentation

    Buyer Behavior

    Development Of New Product

    Reduce Number Of Test Markets

  • 8/6/2019 Group 5 Cluster Analysis

    11/13

    Disadvantage:-

    Lack Of Specificity

    Lack Specific Technique

    Time Consuming

  • 8/6/2019 Group 5 Cluster Analysis

    12/13

  • 8/6/2019 Group 5 Cluster Analysis

    13/13