{ "cells": [ { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Online Data Collection\n", "- Shared\n", " - API based\n", " - Files and databases\n", "- Web Scrapping\n", "- Manual\n", " - Surveys\n", " - Observation (When desperate)\n", " " ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "subslide" } }, "source": [ "# Online Data Collection Cont.\n", "\n", "- Data is likely to not be organized as rows and columns\n", "- You have to organize the data manually and contruct a DataFrame\n", "- Need to determine what your variables will be\n", "- Need to be very careful with level of analysis here" ] }, { "cell_type": "markdown", "metadata": { "slideshow": { "slide_type": "slide" } }, "source": [ "# Constructing Data Frames Manually\n", "\n", "## Option 1: A List of Dictionaries" ] }, { "cell_type": "code", "execution_count": 36, "metadata": { "collapsed": true }, "outputs": [], "source": [ "import pandas as pd\n", "\n", "record1 = {\"name\":\"mohammed\", \"id\":1234, \"age\":45}\n", "record2 = {\"name\":\"Ali\", \"id\":1235, \"age\":35}\n", "record3 = {\"name\":\"Sara\", \"id\":1236, \"age\":25}\n", "\n", "list_of_records = [record1, record2, record3]" ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": true, "slideshow": { "slide_type": "subslide" } }, "outputs": [], "source": [ "# Here is another way to write the previous code\n", "\n", "list_of_records = [\n", " {\"name\":\"mohammed\", \"id\":1234, \"age\":45},\n", " {\"name\":\"Ali\", \"id\":1235, \"age\":35},\n", " {\"name\":\"Sara\", \"id\":1236, \"age\":25},\n", "]" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "slideshow": { "slide_type": "subslide" } }, "outputs": [ { "data": { "text/html": [ "
\n", " | age | \n", "id | \n", "name | \n", "
---|---|---|---|
0 | \n", "45 | \n", "1234 | \n", "mohammed | \n", "
1 | \n", "35 | \n", "1235 | \n", "Ali | \n", "
2 | \n", "25 | \n", "1236 | \n", "Sara | \n", "
\n", " | 0 | \n", "1 | \n", "2 | \n", "
---|---|---|---|
0 | \n", "mohammed | \n", "1234 | \n", "45 | \n", "
1 | \n", "Ali | \n", "1235 | \n", "35 | \n", "
2 | \n", "Sara | \n", "1236 | \n", "25 | \n", "
\n", " | name | \n", "id | \n", "age | \n", "
---|---|---|---|
0 | \n", "mohammed | \n", "1234 | \n", "45 | \n", "
1 | \n", "Ali | \n", "1235 | \n", "35 | \n", "
2 | \n", "Sara | \n", "1236 | \n", "25 | \n", "
\n", " | name | \n", "id | \n", "age | \n", "
---|---|---|---|
0 | \n", "mohammed | \n", "1234 | \n", "45 | \n", "
1 | \n", "Ali | \n", "1235 | \n", "35 | \n", "
2 | \n", "Sara | \n", "1236 | \n", "25 | \n", "
3 | \n", "zaid | \n", "1237 | \n", "33 | \n", "
\n", " | description | \n", "name | \n", "url | \n", "
---|---|---|---|
0 | \n", "**Grit is no longer maintained. Check out libg... | \n", "grit | \n", "https://api.github.com/repos/mojombo/grit | \n", "
1 | \n", "Merb Core: All you need. None you don't. | \n", "merb-core | \n", "https://api.github.com/repos/wycats/merb-core | \n", "
2 | \n", "The Rubinius Language Platform | \n", "rubinius | \n", "https://api.github.com/repos/rubinius/rubinius | \n", "
3 | \n", "Ruby process monitor | \n", "god | \n", "https://api.github.com/repos/mojombo/god | \n", "
4 | \n", "Awesome JSON | \n", "jsawesome | \n", "https://api.github.com/repos/vanpelt/jsawesome | \n", "
5 | \n", "A JavaScript BDD Testing Library | \n", "jspec | \n", "https://api.github.com/repos/wycats/jspec | \n", "
6 | \n", "Unmaintained. Sorry. | \n", "exception_logger | \n", "https://api.github.com/repos/defunkt/exception... | \n", "
7 | \n", "include Enumerable — Unmaintained | \n", "ambition | \n", "https://api.github.com/repos/defunkt/ambition | \n", "
8 | \n", "Generates common user authentication code for ... | \n", "restful-authentication | \n", "https://api.github.com/repos/technoweenie/rest... | \n", "
9 | \n", "Treat an ActiveRecord model as a file attachme... | \n", "attachment_fu | \n", "https://api.github.com/repos/technoweenie/atta... | \n", "
10 | \n", "SUPER OLD STUFF | \n", "microsis | \n", "https://api.github.com/repos/Caged/microsis | \n", "
11 | \n", "psuedo s3 protocol for mozilla browsers | \n", "s3 | \n", "https://api.github.com/repos/anotherjesse/s3 | \n", "
12 | \n", "The solution for tabitus of the browser | \n", "taboo | \n", "https://api.github.com/repos/anotherjesse/taboo | \n", "
13 | \n", "firefox trac integration | \n", "foxtracs | \n", "https://api.github.com/repos/anotherjesse/foxt... | \n", "
14 | \n", "Flash photo widget prototype - hacked at last ... | \n", "fotomatic | \n", "https://api.github.com/repos/anotherjesse/foto... | \n", "
15 | \n", "A realtime, OpenGL graphing library for Ruby | \n", "glowstick | \n", "https://api.github.com/repos/mojombo/glowstick | \n", "
16 | \n", "None | \n", "starling | \n", "https://api.github.com/repos/defunkt/starling | \n", "
17 | \n", "Merb More: The Full Stack. Take what you need;... | \n", "merb-more | \n", "https://api.github.com/repos/wycats/merb-more | \n", "
18 | \n", "A very fast & simple Ruby web server | \n", "thin | \n", "https://api.github.com/repos/macournoyer/thin | \n", "
19 | \n", "Rails RESTful controller abstraction plugin. | \n", "resource_controller | \n", "https://api.github.com/repos/jamesgolick/resou... | \n", "
20 | \n", "Markaby patched to run on rails 2.0.2 | \n", "markaby | \n", "https://api.github.com/repos/jamesgolick/markaby | \n", "
21 | \n", "None | \n", "enum_field | \n", "https://api.github.com/repos/jamesgolick/enum_... | \n", "
22 | \n", "Subtlety: SVN => RSS, hAtom => Atom | \n", "subtlety | \n", "https://api.github.com/repos/defunkt/subtlety | \n", "
23 | \n", "Zippy lil’ zipcode lib. | \n", "zippy | \n", "https://api.github.com/repos/defunkt/zippy | \n", "
24 | \n", "Ghost from Christmas past. Unmaintained. | \n", "cache_fu | \n", "https://api.github.com/repos/defunkt/cache_fu | \n", "
25 | \n", "A ruby library to inexpensively emit runtime ... | \n", "phosphor | \n", "https://api.github.com/repos/KirinDave/phosphor | \n", "
26 | \n", "(offically at github.com/sinatra/sinatra) Clas... | \n", "sinatra | \n", "https://api.github.com/repos/bmizerany/sinatra | \n", "
27 | \n", "Prototype/Javascript wrapper for the Google Se... | \n", "gsa-prototype | \n", "https://api.github.com/repos/jnewland/gsa-prot... | \n", "
28 | \n", "Syncs one directory to another (example: a git... | \n", "duplikate | \n", "https://api.github.com/repos/technoweenie/dupl... | \n", "
29 | \n", "Proof of concept Lazy-Loading for ActiveRecord... | \n", "lazy_record | \n", "https://api.github.com/repos/jnewland/lazy_record | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "
70 | \n", "Rorem is a random data generator | \n", "rorem | \n", "https://api.github.com/repos/jnicklas/rorem | \n", "
71 | \n", "Simple tool to help track vendor branches in a... | \n", "braid | \n", "https://api.github.com/repos/cristibalan/braid | \n", "
72 | \n", "UploadColumn is no longer maintained, check ou... | \n", "uploadcolumn | \n", "https://api.github.com/repos/jnicklas/uploadco... | \n", "
73 | \n", "Ruby on Rails TextMate bundle [master branch i... | \n", "ruby-on-rails-tmbundle | \n", "https://api.github.com/repos/simonjefford/ruby... | \n", "
74 | \n", "OUTDATED mirror of Rack's darcs repository, us... | \n", "rack-mirror | \n", "https://api.github.com/repos/chneukirchen/rack... | \n", "
75 | \n", "(experimental) Mirror of the coset darcs repos... | \n", "coset-mirror | \n", "https://api.github.com/repos/chneukirchen/cose... | \n", "
76 | \n", "JavaScript Unit Test TextMate Bundle [for prot... | \n", "javascript-unittest-tmbundle | \n", "https://api.github.com/repos/drnic/javascript-... | \n", "
77 | \n", "Engine Yard specific capistrano recipes | \n", "eycap | \n", "https://api.github.com/repos/engineyard/eycap | \n", "
78 | \n", "basic darcsum feelalike for Git | \n", "gitsum | \n", "https://api.github.com/repos/chneukirchen/gitsum | \n", "
79 | \n", "Sequel::Model (No longer working on this project) | \n", "sequel-model | \n", "https://api.github.com/repos/wayneeseguin/sequ... | \n", "
80 | \n", "Ruby process monitor | \n", "god | \n", "https://api.github.com/repos/kevinclark/god | \n", "
81 | \n", "blerb running on merb-core | \n", "blerb-core | \n", "https://api.github.com/repos/hornbeck/blerb-core | \n", "
82 | \n", "utilities for implementing a modified pre-orde... | \n", "django-mptt | \n", "https://api.github.com/repos/brosner/django-mptt | \n", "
83 | \n", "a Scheme written in Ruby, but implemented on t... | \n", "bus-scheme | \n", "https://api.github.com/repos/technomancy/bus-s... | \n", "
84 | \n", "Useful pieces of JavaScript. Some old, some new. | \n", "javascript-bits | \n", "https://api.github.com/repos/Caged/javascript-... | \n", "
85 | \n", "Ruby parsers for some Adobe file formats. | \n", "groomlake | \n", "https://api.github.com/repos/Caged/groomlake | \n", "
86 | \n", "Easy and customizable generation of forged data. | \n", "forgery | \n", "https://api.github.com/repos/sevenwire/forgery | \n", "
87 | \n", "Ambition adapter for Sphinx | \n", "ambitious-sphinx | \n", "https://api.github.com/repos/technicalpickles/... | \n", "
88 | \n", "I suppose it's a document database. Or a tuple... | \n", "soup | \n", "https://api.github.com/repos/lazyatom/soup | \n", "
89 | \n", "Ruby on Rails | \n", "rails | \n", "https://api.github.com/repos/josh/rails | \n", "
90 | \n", "An Io web framework of sorts | \n", "backpacking | \n", "https://api.github.com/repos/cdcarter/backpacking | \n", "
91 | \n", "A Capistrano extension for managing and runnin... | \n", "capsize | \n", "https://api.github.com/repos/jnewland/capsize | \n", "
92 | \n", "Starling Message Queue | \n", "starling | \n", "https://api.github.com/repos/bs/starling | \n", "
93 | \n", "The Atom Protocol Exerciser | \n", "ape | \n", "https://api.github.com/repos/sr/ape | \n", "
94 | \n", "Collective Idea's Awesomeness. A collection o... | \n", "awesomeness | \n", "https://api.github.com/repos/collectiveidea/aw... | \n", "
95 | \n", "Audited (formerly acts_as_audited) is an ORM e... | \n", "audited | \n", "https://api.github.com/repos/collectiveidea/au... | \n", "
96 | \n", "Simple geocoding for Active Record models | \n", "acts_as_geocodable | \n", "https://api.github.com/repos/collectiveidea/ac... | \n", "
97 | \n", "an Active Record plugin that makes it easier t... | \n", "acts_as_money | \n", "https://api.github.com/repos/collectiveidea/ac... | \n", "
98 | \n", "None | \n", "calendar_builder | \n", "https://api.github.com/repos/collectiveidea/ca... | \n", "
99 | \n", "When Active Record objects are saved from a fo... | \n", "clear_empty_attributes | \n", "https://api.github.com/repos/collectiveidea/cl... | \n", "
100 rows × 3 columns
\n", "\n", " | description | \n", "name | \n", "url | \n", "
---|---|---|---|
0 | \n", "**Grit is no longer maintained. Check out libg... | \n", "grit | \n", "https://api.github.com/repos/mojombo/grit | \n", "
1 | \n", "Merb Core: All you need. None you don't. | \n", "merb-core | \n", "https://api.github.com/repos/wycats/merb-core | \n", "
2 | \n", "The Rubinius Language Platform | \n", "rubinius | \n", "https://api.github.com/repos/rubinius/rubinius | \n", "
3 | \n", "Ruby process monitor | \n", "god | \n", "https://api.github.com/repos/mojombo/god | \n", "
4 | \n", "Awesome JSON | \n", "jsawesome | \n", "https://api.github.com/repos/vanpelt/jsawesome | \n", "
5 | \n", "A JavaScript BDD Testing Library | \n", "jspec | \n", "https://api.github.com/repos/wycats/jspec | \n", "
6 | \n", "Unmaintained. Sorry. | \n", "exception_logger | \n", "https://api.github.com/repos/defunkt/exception... | \n", "
7 | \n", "include Enumerable — Unmaintained | \n", "ambition | \n", "https://api.github.com/repos/defunkt/ambition | \n", "
8 | \n", "Generates common user authentication code for ... | \n", "restful-authentication | \n", "https://api.github.com/repos/technoweenie/rest... | \n", "
9 | \n", "Treat an ActiveRecord model as a file attachme... | \n", "attachment_fu | \n", "https://api.github.com/repos/technoweenie/atta... | \n", "
10 | \n", "SUPER OLD STUFF | \n", "microsis | \n", "https://api.github.com/repos/Caged/microsis | \n", "
11 | \n", "psuedo s3 protocol for mozilla browsers | \n", "s3 | \n", "https://api.github.com/repos/anotherjesse/s3 | \n", "
12 | \n", "The solution for tabitus of the browser | \n", "taboo | \n", "https://api.github.com/repos/anotherjesse/taboo | \n", "
13 | \n", "firefox trac integration | \n", "foxtracs | \n", "https://api.github.com/repos/anotherjesse/foxt... | \n", "
14 | \n", "Flash photo widget prototype - hacked at last ... | \n", "fotomatic | \n", "https://api.github.com/repos/anotherjesse/foto... | \n", "
15 | \n", "A realtime, OpenGL graphing library for Ruby | \n", "glowstick | \n", "https://api.github.com/repos/mojombo/glowstick | \n", "
16 | \n", "None | \n", "starling | \n", "https://api.github.com/repos/defunkt/starling | \n", "
17 | \n", "Merb More: The Full Stack. Take what you need;... | \n", "merb-more | \n", "https://api.github.com/repos/wycats/merb-more | \n", "
18 | \n", "A very fast & simple Ruby web server | \n", "thin | \n", "https://api.github.com/repos/macournoyer/thin | \n", "
19 | \n", "Rails RESTful controller abstraction plugin. | \n", "resource_controller | \n", "https://api.github.com/repos/jamesgolick/resou... | \n", "
20 | \n", "Markaby patched to run on rails 2.0.2 | \n", "markaby | \n", "https://api.github.com/repos/jamesgolick/markaby | \n", "
21 | \n", "None | \n", "enum_field | \n", "https://api.github.com/repos/jamesgolick/enum_... | \n", "
22 | \n", "Subtlety: SVN => RSS, hAtom => Atom | \n", "subtlety | \n", "https://api.github.com/repos/defunkt/subtlety | \n", "
23 | \n", "Zippy lil’ zipcode lib. | \n", "zippy | \n", "https://api.github.com/repos/defunkt/zippy | \n", "
24 | \n", "Ghost from Christmas past. Unmaintained. | \n", "cache_fu | \n", "https://api.github.com/repos/defunkt/cache_fu | \n", "
25 | \n", "A ruby library to inexpensively emit runtime ... | \n", "phosphor | \n", "https://api.github.com/repos/KirinDave/phosphor | \n", "
26 | \n", "(offically at github.com/sinatra/sinatra) Clas... | \n", "sinatra | \n", "https://api.github.com/repos/bmizerany/sinatra | \n", "
27 | \n", "Prototype/Javascript wrapper for the Google Se... | \n", "gsa-prototype | \n", "https://api.github.com/repos/jnewland/gsa-prot... | \n", "
28 | \n", "Syncs one directory to another (example: a git... | \n", "duplikate | \n", "https://api.github.com/repos/technoweenie/dupl... | \n", "
29 | \n", "Proof of concept Lazy-Loading for ActiveRecord... | \n", "lazy_record | \n", "https://api.github.com/repos/jnewland/lazy_record | \n", "
... | \n", "... | \n", "... | \n", "... | \n", "
70 | \n", "Rorem is a random data generator | \n", "rorem | \n", "https://api.github.com/repos/jnicklas/rorem | \n", "
71 | \n", "Simple tool to help track vendor branches in a... | \n", "braid | \n", "https://api.github.com/repos/cristibalan/braid | \n", "
72 | \n", "UploadColumn is no longer maintained, check ou... | \n", "uploadcolumn | \n", "https://api.github.com/repos/jnicklas/uploadco... | \n", "
73 | \n", "Ruby on Rails TextMate bundle [master branch i... | \n", "ruby-on-rails-tmbundle | \n", "https://api.github.com/repos/simonjefford/ruby... | \n", "
74 | \n", "OUTDATED mirror of Rack's darcs repository, us... | \n", "rack-mirror | \n", "https://api.github.com/repos/chneukirchen/rack... | \n", "
75 | \n", "(experimental) Mirror of the coset darcs repos... | \n", "coset-mirror | \n", "https://api.github.com/repos/chneukirchen/cose... | \n", "
76 | \n", "JavaScript Unit Test TextMate Bundle [for prot... | \n", "javascript-unittest-tmbundle | \n", "https://api.github.com/repos/drnic/javascript-... | \n", "
77 | \n", "Engine Yard specific capistrano recipes | \n", "eycap | \n", "https://api.github.com/repos/engineyard/eycap | \n", "
78 | \n", "basic darcsum feelalike for Git | \n", "gitsum | \n", "https://api.github.com/repos/chneukirchen/gitsum | \n", "
79 | \n", "Sequel::Model (No longer working on this project) | \n", "sequel-model | \n", "https://api.github.com/repos/wayneeseguin/sequ... | \n", "
80 | \n", "Ruby process monitor | \n", "god | \n", "https://api.github.com/repos/kevinclark/god | \n", "
81 | \n", "blerb running on merb-core | \n", "blerb-core | \n", "https://api.github.com/repos/hornbeck/blerb-core | \n", "
82 | \n", "utilities for implementing a modified pre-orde... | \n", "django-mptt | \n", "https://api.github.com/repos/brosner/django-mptt | \n", "
83 | \n", "a Scheme written in Ruby, but implemented on t... | \n", "bus-scheme | \n", "https://api.github.com/repos/technomancy/bus-s... | \n", "
84 | \n", "Useful pieces of JavaScript. Some old, some new. | \n", "javascript-bits | \n", "https://api.github.com/repos/Caged/javascript-... | \n", "
85 | \n", "Ruby parsers for some Adobe file formats. | \n", "groomlake | \n", "https://api.github.com/repos/Caged/groomlake | \n", "
86 | \n", "Easy and customizable generation of forged data. | \n", "forgery | \n", "https://api.github.com/repos/sevenwire/forgery | \n", "
87 | \n", "Ambition adapter for Sphinx | \n", "ambitious-sphinx | \n", "https://api.github.com/repos/technicalpickles/... | \n", "
88 | \n", "I suppose it's a document database. Or a tuple... | \n", "soup | \n", "https://api.github.com/repos/lazyatom/soup | \n", "
89 | \n", "Ruby on Rails | \n", "rails | \n", "https://api.github.com/repos/josh/rails | \n", "
90 | \n", "An Io web framework of sorts | \n", "backpacking | \n", "https://api.github.com/repos/cdcarter/backpacking | \n", "
91 | \n", "A Capistrano extension for managing and runnin... | \n", "capsize | \n", "https://api.github.com/repos/jnewland/capsize | \n", "
92 | \n", "Starling Message Queue | \n", "starling | \n", "https://api.github.com/repos/bs/starling | \n", "
93 | \n", "The Atom Protocol Exerciser | \n", "ape | \n", "https://api.github.com/repos/sr/ape | \n", "
94 | \n", "Collective Idea's Awesomeness. A collection o... | \n", "awesomeness | \n", "https://api.github.com/repos/collectiveidea/aw... | \n", "
95 | \n", "Audited (formerly acts_as_audited) is an ORM e... | \n", "audited | \n", "https://api.github.com/repos/collectiveidea/au... | \n", "
96 | \n", "Simple geocoding for Active Record models | \n", "acts_as_geocodable | \n", "https://api.github.com/repos/collectiveidea/ac... | \n", "
97 | \n", "an Active Record plugin that makes it easier t... | \n", "acts_as_money | \n", "https://api.github.com/repos/collectiveidea/ac... | \n", "
98 | \n", "None | \n", "calendar_builder | \n", "https://api.github.com/repos/collectiveidea/ca... | \n", "
99 | \n", "When Active Record objects are saved from a fo... | \n", "clear_empty_attributes | \n", "https://api.github.com/repos/collectiveidea/cl... | \n", "
100 rows × 3 columns
\n", "\n", " | created_at | \n", "description | \n", "favourites_count | \n", "followers_count | \n", "location | \n", "protected | \n", "screen_name | \n", "statuses_count | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "2017-11-18 15:55:35 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "tninhszizuhqd41 | \n", "0 | \n", "
1 | \n", "2017-11-12 20:13:57 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "jvfTv4KIBgzZUla | \n", "0 | \n", "
2 | \n", "2017-11-18 15:38:16 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "mRgZgj6rETxAP0u | \n", "0 | \n", "
3 | \n", "2017-11-18 15:22:06 | \n", "\n", " | 1 | \n", "4 | \n", "\n", " | False | \n", "8Dlb6cTzeNFtulj | \n", "0 | \n", "
4 | \n", "2012-06-14 05:37:50 | \n", "\n", " | 1996 | \n", "3 | \n", "kuwait | \n", "True | \n", "Q8_lanko | \n", "0 | \n", "
5 | \n", "2017-11-16 18:33:53 | \n", "\n", " | 0 | \n", "6 | \n", "الاحمدى, دولة الكويت | \n", "False | \n", "saidfaw79658353 | \n", "0 | \n", "
6 | \n", "2017-08-31 14:20:41 | \n", "\n", " | 0 | \n", "2 | \n", "\n", " | False | \n", "Oussama35543613 | \n", "0 | \n", "
7 | \n", "2017-11-18 14:54:20 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "sherlysimpson11 | \n", "0 | \n", "
8 | \n", "2017-11-13 18:53:38 | \n", "\n", " | 48 | \n", "27 | \n", "دولة الكويت | \n", "False | \n", "abdulla13469511 | \n", "3 | \n", "
9 | \n", "2016-10-18 11:05:30 | \n", "\n", " | 82 | \n", "50 | \n", "Kuwait | \n", "True | \n", "AmrKhal42267963 | \n", "52 | \n", "
10 | \n", "2017-11-17 05:45:17 | \n", "محدش مرتاح | \n", "7 | \n", "3 | \n", "جليب الشيوخ, دولة الكويت | \n", "False | \n", "etJblAerk4Qyd0w | \n", "6 | \n", "
11 | \n", "2017-10-27 09:59:52 | \n", "\n", " | 263 | \n", "11 | \n", "الفروانيه, دولة الكويت | \n", "False | \n", "u9gnR8XZ2qIRdLO | \n", "60 | \n", "
12 | \n", "2017-10-09 14:47:19 | \n", "I LIKE BATMAN AND TURTULS | \n", "34 | \n", "4 | \n", "دولة الكويت | \n", "False | \n", "FAHAD_ALDEEN | \n", "11 | \n", "
13 | \n", "2017-10-10 05:03:25 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "gehad_kretnah | \n", "0 | \n", "
14 | \n", "2017-11-18 13:41:41 | \n", "\n", " | 1 | \n", "1 | \n", "\n", " | False | \n", "Jamal19838614 | \n", "0 | \n", "
15 | \n", "2017-11-18 13:34:15 | \n", "\n", " | 0 | \n", "1 | \n", "\n", " | False | \n", "vennieackarie22 | \n", "0 | \n", "
16 | \n", "2017-11-17 10:28:48 | \n", "Be Brave 🦋 | \n", "6 | \n", "5 | \n", "\n", " | False | \n", "89Khuloud | \n", "13 | \n", "
17 | \n", "2017-11-18 12:05:39 | \n", "بتلاقيني جنبك وقت إنكسارك ، تطمنّ ما أشبهك. | \n", "1 | \n", "9 | \n", "الكويت | \n", "False | \n", "AbrarMarzouq | \n", "4 | \n", "
18 | \n", "2015-10-03 18:54:30 | \n", "#نآقد_يوفنتيني (@MisterLippi) #Scudetto35 #LE6... | \n", "35 | \n", "240 | \n", "Turin, Piedmont | \n", "False | \n", "MrLippi_Q8 | \n", "2082 | \n", "
19 | \n", "2010-12-14 18:27:08 | \n", "اكاديمي كويتي متقاعد يهوي الضحك ويعشق سكيك الك... | \n", "2361 | \n", "1437 | \n", "\n", " | False | \n", "q8smily | \n", "8773 | \n", "
\n", " | created_at | \n", "description | \n", "favourites_count | \n", "followers_count | \n", "location | \n", "protected | \n", "screen_name | \n", "statuses_count | \n", "
---|---|---|---|---|---|---|---|---|
0 | \n", "2017-11-13 00:47:15 | \n", "\n", " | 0 | \n", "0 | \n", "بيتي | \n", "False | \n", "Ketman_8lm | \n", "4 | \n", "
1 | \n", "2017-11-18 15:08:23 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "zN0eAegNnhq6MbM | \n", "0 | \n", "
2 | \n", "2017-11-18 15:55:35 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "tninhszizuhqd41 | \n", "0 | \n", "
3 | \n", "2017-11-12 20:13:57 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "jvfTv4KIBgzZUla | \n", "0 | \n", "
4 | \n", "2017-11-18 15:38:16 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "mRgZgj6rETxAP0u | \n", "0 | \n", "
5 | \n", "2017-11-18 15:22:06 | \n", "\n", " | 1 | \n", "4 | \n", "\n", " | False | \n", "8Dlb6cTzeNFtulj | \n", "0 | \n", "
6 | \n", "2012-06-14 05:37:50 | \n", "\n", " | 1996 | \n", "3 | \n", "kuwait | \n", "True | \n", "Q8_lanko | \n", "0 | \n", "
7 | \n", "2017-11-16 18:33:53 | \n", "\n", " | 0 | \n", "6 | \n", "الاحمدى, دولة الكويت | \n", "False | \n", "saidfaw79658353 | \n", "0 | \n", "
8 | \n", "2017-08-31 14:20:41 | \n", "\n", " | 0 | \n", "3 | \n", "\n", " | False | \n", "Oussama35543613 | \n", "0 | \n", "
9 | \n", "2017-11-18 14:54:20 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "sherlysimpson11 | \n", "0 | \n", "
\n", " | created_at | \n", "description | \n", "favourites_count | \n", "followers_count | \n", "location | \n", "protected | \n", "screen_name | \n", "statuses_count | \n", "
---|---|---|---|---|---|---|---|---|
590 | \n", "2017-11-12 00:20:19 | \n", "أشهد أن لا إله إلا الله وأشهد أن سيدنا محمد عب... | \n", "1 | \n", "12 | \n", "السالمية, دولة الكويت | \n", "False | \n", "Ibrahim74588329 | \n", "0 | \n", "
591 | \n", "2017-11-11 17:45:51 | \n", "الأخبار العامة أخبار العالم العلوم و التكنولوجيا | \n", "101 | \n", "10 | \n", "\n", " | False | \n", "laila_aleidan | \n", "16 | \n", "
592 | \n", "2011-11-22 22:37:05 | \n", "\n", " | 0 | \n", "13 | \n", "\n", " | False | \n", "ibad80 | \n", "1 | \n", "
593 | \n", "2017-11-11 23:25:33 | \n", "\n", " | 0 | \n", "1 | \n", "\n", " | False | \n", "Y69fP456EU8ND9V | \n", "0 | \n", "
594 | \n", "2017-11-11 22:34:57 | \n", "\n", " | 0 | \n", "12 | \n", "Kuwait | \n", "False | \n", "moalsuwailem1 | \n", "0 | \n", "
595 | \n", "2017-10-08 21:51:35 | \n", "\n", " | 0 | \n", "2 | \n", "\n", " | False | \n", "Tbmf3 | \n", "0 | \n", "
596 | \n", "2017-11-11 22:16:57 | \n", "My Gob | \n", "1 | \n", "2 | \n", "\n", " | False | \n", "MuzafforIslam1 | \n", "4 | \n", "
597 | \n", "2017-11-11 22:10:36 | \n", "\n", " | 0 | \n", "0 | \n", "\n", " | False | \n", "gC9DV7GIahGn2QM | \n", "0 | \n", "
598 | \n", "2013-10-08 23:36:08 | \n", "#HackedByJM511 @T4TBHH | \n", "2111 | \n", "274 | \n", "\n", " | False | \n", "desinger_j84 | \n", "380 | \n", "
599 | \n", "2017-10-30 22:03:52 | \n", "Interior Designer , Artist , proud MOM 🕊 | \n", "0 | \n", "34 | \n", "Mishrif, Kuwait | \n", "False | \n", "AbrarMuhsen | \n", "2 | \n", "
\n", " | created_at | \n", "description | \n", "favourites_count | \n", "followers_count | \n", "location | \n", "protected | \n", "screen_name | \n", "statuses_count | \n", "
---|---|---|---|---|---|---|---|---|
131 | \n", "2017-10-12 20:27:56 | \n", "وقل اعملوا فسيرى الله عملكم | \n", "168 | \n", "31 | \n", "\n", " | False | \n", "1BkicRwkk8s2go4 | \n", "42 | \n", "
283 | \n", "2017-11-15 07:15:39 | \n", "\n", " | 9 | \n", "10 | \n", "\n", " | False | \n", "stapleford1984 | \n", "0 | \n", "
264 | \n", "2017-11-14 09:54:23 | \n", "\n", " | 21 | \n", "11 | \n", "\n", " | False | \n", "x50jeAwF0urESzl | \n", "38 | \n", "
235 | \n", "2017-06-10 03:12:21 | \n", "\n", " | 166 | \n", "14 | \n", "\n", " | False | \n", "mazen_zazo66 | \n", "123 | \n", "
299 | \n", "2017-11-14 22:45:36 | \n", "\n", " | 2 | \n", "7 | \n", "\n", " | False | \n", "noor19816 | \n", "28 | \n", "
525 | \n", "2017-11-12 18:18:52 | \n", "\n", " | 1 | \n", "25 | \n", "\n", " | False | \n", "ygjjgkihnjjb | \n", "3 | \n", "
487 | \n", "2017-10-13 02:56:52 | \n", "I sneak drinks into movie theatres, Meditation... | \n", "0 | \n", "7 | \n", "Windham, NH | \n", "False | \n", "JenjenSheeza | \n", "2 | \n", "
191 | \n", "2017-11-16 11:09:31 | \n", "اللهم لا تجعل ذكر أمي ينقطع وسخر لها الدعوات ط... | \n", "0 | \n", "41 | \n", "Kuwait | \n", "False | \n", "dalalii_32 | \n", "6 | \n", "
560 | \n", "2013-05-29 20:58:11 | \n", "ألحوآر مع آلجھلاء ' كآلرسم على ميآھ آلبحر ! مھ... | \n", "0 | \n", "280 | \n", "Kuwait | \n", "False | \n", "hoda7647675 | \n", "850 | \n", "
459 | \n", "2013-06-08 20:19:16 | \n", "لن نخضع والله خيرا حافظا | \n", "0 | \n", "164 | \n", "q8 | \n", "False | \n", "tarekelfahhed | \n", "966 | \n", "
\n", " | path | \n", "title | \n", "
---|---|---|
0 | \n", "http://www.alanba.com.kw/ar/kuwait-news/offici... | \n", "بالفيديو.. الأمير بحث مع رئيس جزر القمر التعاو... | \n", "
1 | \n", "http://www.alanba.com.kw/ar/kuwait-news/823245... | \n", "توظيف 5500 مواطن منهم 2500 تربوي فوراً | \n", "
2 | \n", "http://www.alanba.com.kw/ar/kuwait-news/parlia... | \n", "بالفيديو.. «حقوق الإنسان»:\\nالعفو الأميري.. ال... | \n", "
3 | \n", "http://www.alanba.com.kw/ar/kuwait-news/educat... | \n", "«التربية»: لموظفي «البيئة»\\nحق دخول كل الأماكن... | \n", "
4 | \n", "http://www.alanba.com.kw/ar/kuwait-news/823262... | \n", "مجلس الأمة يصوّت بالموافقة على \"الديوان الوطني... | \n", "
5 | \n", "http://www.alanba.com.kw/ar/kuwait-news/823264... | \n", "\"الداخلية\" ترفع الحظر عن دخول عاملات المنازل ا... | \n", "
6 | \n", "http://www.alanba.com.kw/ar/kuwait-community/o... | \n", "بالفيديو.. دار الوفاء.. «الأنباء» تحتفي بكوكبة... | \n", "
7 | \n", "http://www.alanba.com.kw/ar/world-news/823283/... | \n", "فيديو منفر\\nلطفلة لبنانية تدخن..\\n\"بمباركة\" وا... | \n", "
8 | \n", "http://www.alanba.com.kw/ar/kuwait-news/823272... | \n", "\"الأنباء\" تنشر أسماء دفعة جديدة رشحها \"ديوان ا... | \n", "
9 | \n", "http://www.alanba.com.kw/ar/kuwait-news/823271... | \n", "الروضان يُحدد السقف الأعلى لاستقدام العمالة ال... | \n", "
10 | \n", "http://www.alanba.com.kw/ar/last/823279/04-04-... | \n", "بالفيديو.. امرأة في معطف من الفرو تأخذ صورة مع... | \n", "
11 | \n", "http://www.alanba.com.kw/ar/sport-news/world-s... | \n", "رسمياً \"فيفا\" يهدد اللاعب محمد صلاح بالاستبعاد... | \n", "
12 | \n", "http://www.alanba.com.kw/ar/world-news/823289/... | \n", "بالفيديو.. بلدية دبي تكشف\\nحقيقة الفيديو المتد... | \n", "
13 | \n", "http://www.alanba.com.kw/ar/art-news/arabic-in... | \n", "منشور يشعل النار\\nبين أحلام وإليسا.. والرد تغريدة | \n", "
14 | \n", "http://www.alanba.com.kw/ar/art-news/arabic-in... | \n", "بالفيديو.. محمد عبده يغنى\\nعلى الهواء احتفالا ... | \n", "
15 | \n", "http://www.alanba.com.kw/ar/art-news/arabic-in... | \n", "بالصور.. هذا هو سعر جاكيت نانسي عجرم في فيديو ... | \n", "
16 | \n", "http://www.alanba.com.kw/ar/last/823284/04-04-... | \n", "شاهد: البذخ ثم البذخ..\\nهكذا احتفلت عائلة كارد... | \n", "
17 | \n", "http://www.alanba.com.kw/ar/world-news/823287/... | \n", "مفاجأة ..رحيل روائي مصري\\nشهير تنبأ بموعد وتار... | \n", "
18 | \n", "http://www.alanba.com.kw/ar/art-news/823290/04... | \n", "بالفيديو والصور..هكذا احتفلت\\nأمل العوضي بميلا... | \n", "
19 | \n", "http://www.alanba.com.kw/ar/last/823285/04-04-... | \n", "صورة رضيع تجتاح مواقع التواصل الاجتماعي.. والس... | \n", "
20 | \n", "http://www.alanba.com.kw/ar/art-news/arabic-in... | \n", "بالفيديو.. نجمة عربية تعترف للمرة الاولى عن تع... | \n", "
21 | \n", "http://www.alanba.com.kw/ar/world-news/823280/... | \n", "فيديو لطالب يمني جامعي بالهند ضربوه وسحلوه بهج... | \n", "