If you guys are into deep learning and always dream that if you have your own computer with GPU which helps you to run all deep learning programs. In that case I want to share the configuration which I have bought, to make my own.
I will provide you the configuration and it cost me $1650. Here is the configure.
INTEL I5 7700K CPU BOX
8 GB RAM DDR4 VENGENACE (8GB X 2)= 16 GB RAM
SSD 850 EVO 250GB
1 TB SATA HDD
GEFORCEGTX 1060 6GB
BITFENIX PRODIGY CABINET
22″ LED MONITOR WITH VGA DVI HDMI GW2270HM
650 W power supply COOLER MASTER
Cooler Master Hyper 212 X heat sink
HP COMBO C2500 DESKTOP J8F15AA
This configuration works well for me. I have used most of the available deep learning libraries on GPU mode and it works really well.
If you are planning to buy machine which can be used for deep learning then you should defiantly consider this configuration. By the way I’m using this since last two months. It is really amazing experience.
Data science creates lot of buzz since past few years. There are so many questions come into our mind when we heard the term data science such as Why this field creates a lot of buzz , What kind of Data is needed for Data science,What are the important aspects of the data science , What are the applications of Data science, What are the techniques available to solve data science related problems, How can anybody can getting into the data science.etc… let’s check out all the questions related to data science.
What is Data Science?
Let’s go back in 1990, when world wide web is evolved , slowly and gradually people are using this powerful invention from last 25 years and making it batter.
The data volumes are exploding, more data has been created in the past two years than in the entire previous history of the human race on web.
Now a days world wide web is the major resources of the data.People from all over the world using web everyday .This usage generate lot and lots of data. According to the report on EMC, In 2013 , we have 4.4 zettabytes (ZB) of data on web.This ZB of data contains historical data as well as real time data. Everyday we are interacting with data whether its social media, web search,news,blogs,videos,images,documents etc..
Now we have more then enough data which can be used for extracting knowledge out of it. After analysing the data by using proper scientific techniques we can find some of the hidden pattern or facts from the data which will lead us to solve existing unsolvable questions.
“This scientific way of analysing data or extracting knowledge out of data is called Data science.”
“Data science is all about making sense out of the data or extracting the knowledge from the data using data science techniques.”
What kind of Data is needed for Data science?
There are three kind of data available on web,
This kind of data is highly organised. Data is stored in table
A data model explicitly determines the structure of data.
This kind of data has relational key and they are stored in relational databases.
Examples: Student information database,Employee information database, etc..
Semi-structured data is a form of structured data but it is not completely similar to the structured data.
It contains tags ,other markers or key-value pairs to separate semantic elements and enforce hierarchies of records and fields within the data. Therefore, it is also known as self-describing structure.
Examples : XML, Json, CSV
Majorly on web we find data which does not follow any structure.
This kind of data is not neatly fit in to the traditional relational databases.
Examples: Satellite images, Scientific data, Photos, Videos, Radar data, Mobile data, Text , web content,Social media etc…
Majorly semi-structure and unstructured data set is used for solving data science related problems. There are very small set of applications in which structured data can be used.
Why does data science field create a lot of buzz?
In current era, We have lot of data, cheap but efficient hardware, tools and techniques which emerging in last few years to solve the previously unsolvable questions , these are the factors which create buzz around the data science.
Aspects of the Data Science
Data science is umbrella term, this field contains many other fields in it.
Data science includes Statistics, Programming, Machine Learning, Natural Language Processing(NLP), Text Mining, Visualisation, Big Data, Data Ingestion, Data Munging, Tools for data science.
Data science techniques
Data science techniques majorly include statistics, Machine learning and Deep Learning for solving problems like speech recognition, Image recognition, various NLP applications, etc..
Data science tool kit
Those who are coming from the technical background can use following tools
Scripting language for rapid prototyping (Scala or Python)
R – Statistics programming tool
Deep Learning libraries tenserflow, torch, Deeplearning4j etc…
Social media libraries
Basic Machine learning libraries
Those who are coming from the non-technical background can use following tools
Google cloud prediction API
Internet Search – Ranking algorithms
Digital advertisement – Statistics techniques heavily used
Recommend system -Machine learning techniques majorly used
Image recognition – Deep Neural Network /Deep and wide Neural Network
Speech recognition – Deep Neural Network /Deep and wide Neural Network/Linguistics techniques
Gaming – Machine learning / Deep Neural Network /Deep and wide Neural Network
Credit risk modelling – Statistics and Machine learning
Fraud detection – Statistics, Machine learning and graph theory
Social Media Intelligence – NLP, Sentiment analysis, Influence detection, etc..
Intelligent Chat bots – Statistics, Machine learning, NLP and deep learning
Self driving car -Rule based system
Robots – under research
From next post onward, I am going to start tutorial series for data science beginners.
This tutorial series includes
Ubuntu for beginners
Tool kit list and installation guide
Regular expression guide
Scraping of the data
Data cleaning / pre-processing
Basics of statistics
Basics of Machine learning techniques
Apply machine learning techniques on pre-processed data