focused-clustering

Stars
22

Focused Clustering and Outlier Detection in Large Attributed Graphs

Bryan Perozzi

Disclaimers

  1. This code is very "research", and so is probably more useful as an example than a product
  2. Distance Metric Learning based on original code from Eric Xing, available here

Implementation Overview

There are two programs. The first is a matlab script which learns a distance metric and reweighs the input graph. The second is a java program which extracts communities & outliers from the reweighted graph

Running

An example batch file focusco.bat shows how to use the matlab program from the command line. (it'll run the whole thing soon). It can be run like so:

>focusco.bat example.edges example.features example.similar

Which will produce focusco.out.weighted.edges.

Installation

Requirements

  1. A recent version of Matlab
  2. Java 6+

Setup

The only required step should be to build the java, e.g. with maven:

  1. $ cd java_src
  2. $ mvn clean install