Skip to content

lostdevfound/GetSocialNetwork

Repository files navigation

GetSocialNetwork

getsocialnetwork

A framework to get data from a social network. Python 2.7.10

Currently the supported platform is LinkedIn. The framework supports only English language. The module doesn't need any external libs. Case study on my blog: http://renderstory.com/linkedinstats/

Usage:

# Quick start:
# Create a file in the module directory and type in the following code
# import all project libraries.

from linkedInStats import LinkedInStats
from socialNetwork import LinkedIn

# initiazlie LinkedInStats class to login into the network.
lstats = LinkedInStats('[email protected]', 'password')

# first we need to create the database of all connections. 
# The framework will download all contacts' profile pages to the local drive for further processing
# Once the download is over, you'll get a dataBase.json file which will contain various entries about 1st level contacts.
dataBase, errorProfiles = lstats.createDataBase(-1,4)

# then we can call other methods which output txt file stats in the root of the module.
lstats.workOverTime(2008,2015)
lstats.experienceStats()
lstats.locationStats()
lstats.companyStats()
lstats.workStats()

The socialNetwork.py has data scraping methods.

Usage:

from socialNetwork import LinkedIn
# initialize the class.
link = LinkedIn('[email protected]','password')
# Get profile n level connection by specifying name and lastname.
profileData = l.get2ndConnections('Name','Lastname','fileDir')

The txt files created by LinkedInStats methods can be easily graphed in Excel and Pandas.

mystats

program pipeline

Known bugs and restrictions. 0. Since LinkedIn periodically updates its website api and html content some of GetSocialNetwork features might not work

  1. Sometimes, regex expressions do not grab the proper information. This results in minor inaccuracies in stats.
  2. Languages other than English are not supported. Thus, the profiles which use other languages may cause errors.
  3. The speed of scraping is deliberately limited, so LinkedIn doesn't think it is a bad bot.
  4. The recursive methods getProfileConnections and getAllConnections should be used with care!
  5. The overuse and abuse of getProfileConnections and getAllConnections may cause issues with LinkedIn.

About

A framework to get the data from social networks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages