Abstract:
Real world social network is represented as a graph structure which has a
collection of non-empty set of nodes and set of connections of nodes. The nodes in a
social graph are people, and edges represent the connection between them. People in
the society are naturally grouped to form communities in their workplace, friendship
and family. The study of complex social network topology has triggered the interest of
many scientists in recent years. Some important features of a social network need to be
understood in terms of node density in cluster, degree of node, reachability, size of
network, geodesic distance and diameter. Thus, the idea of community within a social
network has emerged as an educational tool for social network analysis that lays the
foundation for some high-level initiatives. Finding community structure is the key to
research the structural components of relationships in the social network. Community
detection aims at grouping nodes based on the links between them to form strongly link
sub groups of graph from the whole graph.
Many community discovery techniques have been borrowed on inspired from
the problem of hierarchical clustering, graph partitioning in modern graph theory, as
well as the graph clustering or dense sub-graphs discovery problem in the graph mining
area. These algorithms were designed to reveal the mesoscopic nature of various
networks. It is not known how good the algorithm is in terms of computational time,
efficiency of these algorithms and accuracy, which remains still open. Community
detection in social network is one kind of the optimization problems because it cannot
get exact solution and only get an optimal solution. So, various types of community
detection algorithms attempt to capture the intuitive notation that nodes within the
group have higher intra-density and have lower inter-density with other nodes in the
other group.
Population-based nature inspired optimization algorithms have attracted
extensive research interests over the past decade. Researchers used these algorithms to
search the hidden communities in social networks. An enhanced discrete artificial bee
colony D-ABC algorithm is proposed for solving community detection problem in
social networks. In this thesis,some of the challenges and applications of social network
community discovery for analyzing network documents will be explored. The proposed
system focuses on undirected and unweighted social networks and considers the
connection of nodes for the community detection problem.
The proposed algorithm D-ABC is designed with an effective initialization
strategy with label-based solution representation and one way crossover operation of
genetic operators is used in the search strategy process. D-ABC selects modularity
function as the fitness function of each solution. The efficiency of searching process
can be improved by heuristic function which contain genetic algorithm operations. Most
of the real social network community detection problem do not know each community
size and numbers of community. D-ABC algorithm can also detect communities in
social network that does not know real community numbers and sizes. When it detects
the communities on a network structure, it uses neighbor nodes similarity measurement.
In this thesis, standard social datasets with ground-truth results and various scale real world networks without ground-truth from the Stanford Large Network Dataset
Collection are used. The performance of the proposed system is assessed by accuracy,
efficiency and effectiveness. The result indicates good quality with accurate
communities as well as their density improvement.