Managing Your Data
What is a database?
Almost every data scientist will spend time working in a database, which is an organized collection of structured data in a computer system. (Remember, structured data is usually organized in a table format with rows and columns, like the following example.)
Last four digits of social security number | Last name | Age |
---|---|---|
6881 | Marshall | 23 |
0121 | Rodriguez | 19 |
5538 | Cho | 59 |
2972 | Parker | 33 |
3154 | Sawyer | 72 |
Most databases today are organized as relational databases, which are collections of multiple data sets or tables that link together.
While SQL is the underlying language that drives most work done in relational databases, there are many RDBMSs in which you can do that work. As you venture into this field, you’ll run into names like these:
- MySQL
- Microsoft Access
- PostgreSQL
- Oracle
- IBM DB2
- MongoDB
Choose the right tools to manage data
Where do you begin? There are dozens of useful data science tools and platforms! Here’s a list of some popular and open source platforms that you can use to begin your own data science journey.
No comments:
Post a Comment