Instagramy
Python Package for Instagram Without Any external dependencies
Scrape Instagram Users Information, Posts data, Hashtags and Locations data. This Package scrapes the user's recent posts with some information like likes, comments, captions and etc. No external dependencies.
Features
- It scrapes most of the data of Instagram user, hastags, Posts and Location
- You can use this package with login or without login
- Download Instagram post and User profile picture
- Have some plugins for Data analysis
- No External dependencies
- Having caching Function
- Lightweight
- Easy to Use
Download
Installation
pip install instagramy
Upgrade
pip install instagramy --upgrade
Sample Usage
Getting Session Id of Instrgram
For Login into Instagram via instagramy session id is required. No username or password is Needed. You must be login into Instagram via Browser to get session id
- Login into Instagram in default webbrowser
- Move to Developer option
- Copy the sessionid
- Move to storage and then to cookies and copy the sessionid (Firefox)
- Move to Application and then to storage and then to cookies and copy the sessionid (Chrome)
Note: Check for session id frequently, It may be changed by Instagram
Instagram User details
Class InstagramUser
scrape some of the information related to the user of the Instagram
>>> from instagramy import InstagramUser
>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"
>>> user = InstagramUser('google', sessionid=session_id)
>>> user.is_verified
True
>>> user.biography
'Google unfilteredโsometimes with filters.'
>>> user.user_data # More data about user as dict
If you get the data of the user onetime, instagramy store the data as cache file for avoid the error. you can get the data from cache also. Don't provide the sessionid.
>>> from instagramy import InstagramUser
>>> user = InstagramUser('google', from_cache=True)
>>> user.is_verified
True
It is opt of all classes InstagramUser
, InstagramHashTag
and InstagramPost
.
Show all Properties
- biography
- connected_fb_page
- followed_by_viewer
- follows_viewer
- fullname
- has_blocked_viewer
- has_country_block
- has_requested_viewer
- is_blocked_by_viewer
- is_joined_recently
- is_private
- is_verified
- no_of_mutual_follower
- number_of_followers
- number_of_followings
- number_of_posts
- other_info
- posts
- posts_display_urls
- profile_picture_url
- requested_by_viewer
- restricted_by_viewer
- username
- website
InstagramUser.user_data
has more data other than defined as Properties
Instagram Hashtag details
Class InstagramHashTag
scrape some of the information related to the hash-tag of the Instagram
you can also set your sessionid as env variable
$ export SESSION_ID="38566737751%3Ah7JpgePGAoLxJe%er40q"
>>> import os
>>> from instagramy import InstagramHashTag
>>> session_id = os.environ.get("SESSION_ID")
>>> tag = InstagramHashtag('google', sessionid=session_id)
>>> tag.number_of_posts
9556876
>>> tag.tag_data # More data about hashtag as dict
Show all Properties
- number_of_posts
- posts_display_urls
- profile_pic_url
- tagname
- top_posts
InstagramHashTag.tag_data
has more data other than defined as Properties
Instagram Post details
Class InstagramPost
scrape some of the information related to the particular post of Instagram. It takes the post id as the parameter. You can get the post id from the URL of the Instagram posts from the property of InstagramUser.posts
. or InstagramHagTag.top_posts
>>> from instagramy import InstagramPost
>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"
>>> post = InstagramPost('CLGkNCoJkcM', sessionid=session_id)
>>> post.author
'ipadpograffiti'
>>> post.number_of_likes
1439
>>> post.post_data # More data about post as dict
Show all Properties
- author
- caption
- display_url
- get_json
- number_of_comments
- number_of_likes
- post_source
- text
- type_of_post
- upload_time
InstagramPost.post_data
has more data other than defined as Properties
Instagram Location details
Class InstagramLocation
scrape some of the information and posts related to the given Location . It takes the location id and slug as the parameter. You can get the location id and slug from the URL of the Instagram Location or from the property of InstagramPost.location.id
and InstagramPost.location.slug
.
>>> from instagramy import InstagramPost
>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"
>>> post = InstagramPost('CLGkNCoJkcM', sessionid=session_id)
>>> location_id, slug = post.location.id, post.location.slug
>>> from Instagramy import InstagramLocation
>>> location = InstagramLocation(location_id, slug, session_id)
>>> location.latitude
28.6139
>>> location.longitude
77.2089
>>> location.address
{'street_address': 'T2, Indira Gandhi International Airport', 'zip_code': '', 'city_name': 'New Delhi', 'region_name': '', 'country_code': 'IN', 'exact_city_match': False, 'exact_region_match': False, 'exact_country_match': False}
you can also get the location id and slug from the instagram url
https://www.instagram.com/explore/locations/977862530/mrc-nagar
https://www.instagram.com/explore/locations/<location_id>/<slug>
Show all Properties
- address
- id
- latitude
- location_data
- longitude
- name
- number_of_posts
- phone
- profile_pic_url
- sessionid
- slug
- top_posts
- url
- viewer
- website
InstagramLocation.location_data
has more data other than defined as Properties
Plugins
Instagramy has some plugins for ease
Plugins for Data Analyzing
- analyze_users_popularity
- analyze_hashtags
- analyze_user_recent_posts
>>> import pandas as pd
>>> from instagramy.plugins.analysis import analyze_users_popularity
>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"
>>> teams = ["chennaiipl", "mumbaiindians",
"royalchallengersbangalore", "kkriders",
"delhicapitals", "sunrisershyd",
"kxipofficial"]
>>> data = analyze_users_popularity(teams, session_id)
>>> pd.DataFrame(data)
Usernames Followers Following Posts
0 chennaiipl 6189292 194 5646
1 mumbaiindians 6244961 124 12117
2 royalchallengersbangalore 5430018 59 8252
3 kkriders 2204739 68 7991
4 delhicapitals 2097515 75 9522
5 sunrisershyd 2053824 70 6227
6 kxipofficial 1884241 67 7496
Plugins for Downloading Posts
- download_hashtags_posts
- download_post
- download_profile_pic
>>> import os
>>> from instagramy.plugins.download import *
>>> session_id = os.environ.get('SESSION_ID')
>>> download_profile_pic(username='google', sessionid=session_id, filepath='google.png')
>>> download_post(id="ipadpograffiti", sessionid=session_id, filepath='post.mp4')
>>> download_hashtags_posts(tag="tamil", session_id=session_id, count=2)
Use Without Login
You can use this package without login. Sessionid is not required but it may rise RedirectionError
error after four to five requests.
>>> from instagramy import *
>>> user = InstagramUser('google')
>>> user.fullname
'Google'
>>> tag = InstagramHashTag('python')
>>> tag.tag_data
Caching Feature
from version 4.3
, Added the new feature that is caching the required data. If you get the data of the user onetime, instagramy store the data as cache json file for avoid the error. you can get the data from cache also. Don't need to provide the sessionid. Instead of sessionid add the optional parameter from_cache=True
.
>>> from instagramy import InstagramUser
>>> user = InstagramUser('google', from_cache=True)
>>> user.is_verified
True
It is opt of all classes InstagramUser
, InstagramHashTag
, InstagramPost
and InstagramLocation
.
Clear all Caches created by instagramy in current dir by
>>> from instagramy.core.cache import clear_caches
>>> clear_caches() # clear all caches of instagramy
List of all Cache files created by instagramy in current dir
>>> from instagramy import list_caches
>>> list_caches() # list all caches of instagramy
Sample Scripts
Getting Email address and phone number
user = InstagramUser('username')
email, phone_number = user.user_data['business_email'], user.user_data['business_phone_number']
โ๏ธ Important Notes
- Don't send huge request to Instagram with sessionid, Instagram may ban you.
- You can use this package without sessionid (Login). But it may
RedirectionError
after four to five requests. - class
Viewer
provide the data about currently logged in user. - Check for session id frequently, It may be changed by Instagram
- If code execution is never gets completed, check and change your session id and try again.
- Don't provide the wrong session_id.
InstagramUser.user_data
,InstagramPost.post_data
,InstagramHashtag.tag_data
andInstagramLocation.location_data
which is pythondict
has more and more data other than defined asProperties
.- This Package does not scrap all the posts from an account, the limit of the post only 12 (For non-private account)
- This Package not scrap all the posts of given hash-tag and location it only scrapes the top 60 - 72 posts.
Disclaimer
If you send the huge request to the Instagram with session id Instagram may ban you. I am not responsible for any misuse or damage caused by this program.
License
Contributions
Contributions are Welcome. Feel free to report bugs in issue and fix some bugs by creating pull requests. Comments, Suggestions, Improvements and Enhancements are always welcome. Let disscuss about it Here.