Exploring the followers and user of three banks they are Boubyan Bank, Kuwait Financial House and National Bank of Kuwait, through their Twitter accounts after that analysing each bank
We chose Twitter to explore and analyze the data, tweepy was the tool to gather three banks for our project which were Boubyan bank, KFH and NBK, also another source added to gather Net Profit for each bank from (Bayanati.com) website
We gathered three data for each bank:
Followers
Each of the team member got a specific bank, and each gathered 50,000 rows and 28 columns
We Combined the three dataframes under the column Bank Name
The total rows that was gathered is 150,000 rows but we foundout there were 150,028 rows
After clearing the 28 records the amount of the rows decresed
User Timeline
Each team member gathered 3,000+ rows and 15 columns
Tweepy gave a limit number of tweets to collect
It was a clean dataframe
Get User
Each bank got one row and 4 columns
contributors_enabled 0
created_at 11
description 103282
favourites_count 28
followers_count 28
friends_count 28
geo_enabled 28
id 28
id_str 28
lang 40
listed_count 28
location 105031
name 45
profile_background_color 28
profile_background_image_url 119970
profile_background_tile 28
profile_image_url 28
profile_text_color 28
profile_use_background_image 28
protected 28
screen_name 44
statuses_count 43
time_zone 139239
url 142275
utc_offset 139240
verified 45
Bank_Name 0
dtype: int64
- Pandas
- Altair
- Matplotlip
- Numpy
Variable | Variable definition | Data type | Missing data report | Report on the distribution of the data | level of analysis |
---|---|---|---|---|---|
contributors_enabled | Indicates that the user has an account with “contributor mode” enabled, allowing for Tweets issued by the user to be co-authored by another account. Rarely true | Boolean | 0 | the column consists of 8310 unique values | User Tweets |
created_at | The UTC datetime that the user account was created on Twitter | Categorical(object) | 0 | the column consists of 7116 unique values | Entities |
description | The user-defined UTF-8 string describing their account | Continues(float64) | 103271 | The minimum number 0.0 the maximum is 1617.0, and by ploting it Boubyan bank got the highest number of favourites on their tweets | User Tweets |
favourites_count | The number of Tweets this user has liked in the account’s lifetime | Continues(float64) | 0 | the column consists of 1 unique value, Boubyan Bank got the most likes on their tweets as showed in the Pie Chart | User Tweets |
followers_count | The number of followers this account currently has. | Continues(float64) | 0 | the column consists NaN | GEO |
friends_count | The number of users this account is following (AKA their “followings”) | continues(float64) | 0 | The column represents the the number of tweets | User Tweet |
geo_enabled | When true, indicates that the user has enabled the possibility of geotagging their Tweets. This field must be true for the current user to attach geographic data when using POST statuses / update | Boolean | 0 | The column consists 5038 unique values, and Boubyan bank has the highest replyes among the other banks using Pie Chart | Reply to screen name |
id | The integer representation of the unique identifier for this User. | Int64 | 0 | The column consists 3 unique values, the top language among 3 banks in the Arabic using Bar Chart | User Tweets |
id_str | The string representation of the unique identifier for this User | object | 0 | The minimum is 0.0 and the maximum is 692.0 number of retweets | Users |
lang | The BCP 47 code for the user’s self-declared user interface language. May or may not have anything to do with the content of their Tweets | Categorical(object) | 0 | The column consist 1 unique value, Boubyan bank exceeded among the other banks | Users |
listed_count | The number of public lists that this user is a member of | Continues(float64) | 0 | The column consists 8 unique values, Boubyan bank and NBK are mostly using Lithium Tech. and KFH moslty using Hootsuite | Users |
location | The user-defined location for this account’s profile. Not necessarily a location, nor machine-parseable. This field will occasionally be fuzzily interpreted by the Search service | Categorical(object) | 105003 | The column consists 9501 unique values | User Tweets |
name | The name of the user, as they’ve defined it. Not necessarily a person’s name. Typically capped at 20 characters, but subject to change | Categorical(object) | 7 | The column consists 74 unique values, the most active banks is KFH by using Bar Chart | User Tweets |
profile_background_color | The hexadecimal color chosen by the user for their background | Categorical(object) | 0 | 1862 unique values, by analysing url columns by using Bar Chart it shows that KFH has the highest number of urls | User Tweets |
profile_background_image_url | A HTTP-based URL pointing to the background image the user has uploaded for their profile | Object | 119942 | It contains 1226 unique values | User Tweets |
profile_background_tile | When true, indicates that the user’s profile_background_image_url should be tiled when displayed | Boolean | 0 | 4 unique values | Users Tweets |
profile_image_url | A HTTP-based URL pointing to the user’s profile image. | object | 0 | 68805 unique values | Users Tweets |
profile_text_color | The hexadecimal color the user has chosen to display text with in their Twitter UI | object | 0 | 289 unique values, most color code 333333 which is black | User Tweets |
profile_use_background_image | When true, indicates the user wants their uploaded background image to be used | Boolean | 0 | 16 unique values | Users Tweets |
protected | When true, indicates that this user has chosen to protect their Tweets | Boolean | 0 | It consists 15 unique values, the bank with the highest protection is Boubyan bank by 14.4% and the less is NBK by 6.4% | User Tweets |
screen_name | The screen name, handle, or alias that this user identifies themselves with. screen_names are unique but subject to change | object | 0 | 134665 unique values | Users Tweets |
statuses_count | The number of Tweets (including retweets) issued by the user | object | 0 | 16901.0 unique values | Users Tweets |
time_zone | A string describing the Time Zone this user declares themselves within | Categorical(object) | 139195 | 107 unique values, most using in Kuwait and Boubyan bank is the highest among other banks | Location |
url | A URL provided by the user in association with their profile. | object | 142247 | 6839 unique values | Users Tweets |
utc_offset | The offset from GMT/UTC in seconds | integer | 139195 | Minimum -39600.0000 and Maximum 50400.0000 | Location |
verified | When true, indicates that the user has a verified account. | Boolean | 0 | 2 unique values | Users Tweets |
Bank_Name | bank's name | object | 0 | 3 unique values, it represents the 3 banks Boubyan, KFH and NBK | Users Tweets |
Variable | Variable definition | Data type | Missing data report | Report on the distribution of the data | level of analysis |
---|---|---|---|---|---|
Created_at | Time and Date of the tweet by the bank | Categorical(object) | 0 | the column consists of 8310 unique values | User Tweets |
Entities | Contains hashtags, urls, media, user mentions, symbols from tweets | Categorical(object) | 0 | the column consists of 7116 unique values | Entities |
Favorite_count | Number of likes on users' tweets | Continues(float64) | 0 | The minimum number 0.0 the maximum is 1617.0, and by ploting it Boubyan bank got the highest number of favourites on their tweets | User Tweets |
Favorited | Whether the tweets in liked or not | Boolean | 0 | the column consists of 1 unique value, Boubyan Bank got the most likes on their tweets as showed in the Pie Chart | User Tweets |
GEO | Similar to coordinates, it represents the geographic location of this Tweet | Continues(float64) | 9676 | the column consists NaN | GEO |
ID | It represents a unique integer identifire of each tweet | continues(float64) | 0 | The column represents the the number of tweets | User Tweet |
In_reply_to_screen_name | The screen names of people replying to the users | Categorical(object) | 1688 | The column consists 5038 unique values, and Boubyan bank has the highest replyes among the other banks using Pie Chart | Reply to screen name |
Lang | Represents the language of the tweets | Categorical(object) | 0 | The column consists 3 unique values, the top language among 3 banks in the Arabic using Bar Chart | User Tweets |
Retweet_count | The number of times the tweet was retweeted | Continues(float64) | 0 | The minimum is 0.0 and the maximum is 692.0 number of retweets | Users |
Retweeted | Represents whether the tweets are retweeted or not by True and False | Boolean | 0 | The column consist 1 unique value, Boubyan bank exceeded among the other banks | Users |
Source | Different devices used to post the tweets and keep track with | Categorical(object) | 0 | The column consists 8 unique values, Boubyan bank and NBK are mostly using Lithium Tech. and KFH moslty using Hootsuite | Users |
Text | The text writting by the users to tweet any text, hashtags, using symboles, media and urls | Categorical(object) | 0 | The column consists 9501 unique values | User Tweets |
Hashtags | It is use to make some tweets easy to search by adding a (#) symbol on a word or short scentence | Categorical(object) | 9336 | The column consists 74 unique values, the most active banks is KFH by using Bar Chart | User Tweets |
URLs | Is the source of differnt websites for the banks or media websites | Categorical(object) | 7323 | 1862 unique values, by analysing url columns by using Bar Chart it shows that KFH has the highest number of urls | User Tweets |
Profit | The Net Profit of each bank | Continues(float64) | 0 | The minimum profit is -1.180 (in Million Dollars) and the maximum is 1.191 (in Million Dollars) | Banks and Year |
Bank Name | Created_at | Name | Location |
---|---|---|---|
KFH | 09/09/2009 | Kuwait Financial House | Kuwait |
NBK | 15/09/2009 | National Bank of Kuwait | Kuwait |
Boubyan | 02/02/2010 | Boubyan Bank | Kuwait |
Followers Description
followerlocation
followername
the same goes for the utc offset and the urls found for followers that boubyan has the highest in both these columns between the banks.
followerBG1
FollowersBG2
bg_uniqu
conclusion for this graph:
Bank Name | Created_at | Name | Location |
---|---|---|---|
KFH | 09/09/2009 | Kuwait Financial House | Kuwait |
NBK | 15/09/2009 | National Bank of Kuwait | Kuwait |
Boubyan | 02/02/2010 | Boubyan Bank | Kuwait |
Banks URLs
Notice:
boubyan hashtags
KFH hashtags
NBK hashtags
Boubyan Bank
KFH
NBK Notice:
Notice:
Boubyan Bank
KFH
NBK
Notice:
Definition:-
Hootsuite is a social media management platform, The system’s user interface takes the form of a dashboard, and supports social network integrations for Twitter, Facebook, etc
Lithium Technologies is a San Francisco-based provider of software that allows businesses to connect with their customers on social media and digital channels.