2018-06-21 打开数据科学大门的钥匙

R的书一看似乎就会了。就是记不住函数和命令,遇到问题,还需要翻书和文档。

初学者的通病吗?😳

此文说的好“If you don’t practice what you learn, you will forget.”

——学了要用才能记住。


A KEY FOR MASTERING DATA SCIENCE

by Sharp Sight | Jun 20, 2018

A few days ago, I received an email from a Sharp Sight reader. The author of the email is having trouble learning data science in R. He’s taken several data science courses, but still has trouble with critical data science skills.

Here’s an excerpt from his email:

I’ve redacted the company names, but I’ll tell you that he took courses by two very well known data science training companies.

Does this sound familiar?

What Miguel is experiencing is unfortunate, but all too common.

I regularly receive emails from people telling me that they’ve taken courses about data science, but still can’t write code. They initially learned the material, but eventually forgot everything.

This is a critical problem on the path to mastering data science. If you don’t practice what you learn, you will forget.

WHY YOU’RE FAILING TO MASTER DATA SCIENCE SKILLS

The critical problem for students like this isn’t necessarily learning.

Many data science skills are easy enough to understand. dplyr is fairly easy to understand. Although ggplot2 has a few quirks, it is also relatively easy to understand. You can watch a few videos and learn how these tools work. It’s pretty easy to type in some code a few times and get these tools to work on your own computer. These tools are easy to learn.

Moreover, there’s plenty of material out there to help you learn. Although the quality definitely varies, you can find information about almost anything on the internet. Even here at Sharp Sight, we give away tons of free tutorials to help you learn data science.

There’s plenty of tutorials to help you learn, and many techniques (especially the “basics) are easy enough to learn.

But they aren’t necessarily easy to remember. That’s because the human mind naturally forgets.

Have you ever had the experience of asking some their name, only to forget it 5 minutes later? Have you ever learned a fact from a book, but then forgotten it the next day?

This is natural. It’s completely normal. The human brain naturally forgets.

So on the path to mastering data science skills, the roadblock typically isn’t learning so much as the problem is forgetting.

Forgetting what you learn is stopping you from mastering data science.

YOU NEED TO MASTER DATA SCIENCE, NOT JUST “LEARN” IT

At this point, I want to re-emphasize the importance of mastery.

Here at Sharp Sight, we routinely stress the importance of mastery. You need to master data science skills. It’s not enough just to “sort of know them.”

This is particularly true of basic syntax. Like it or not, R syntax (or the syntax of another data science language like Python) gives you the tools to “get things done.” You need syntax to create deliverables and ultimately create business value for your clients.

If you get hired, it’s not good enough to just have a vague memory of the syntax. It’s not good enough to be a cut-and-paste coder who goes to Google every 5 minutes to look up how to write a simple piece of syntax.

To work as a data scientist, you need to be relatively “fluent” in writing data science syntax. If you work in R – like we often do here at Sharp Sight – that means that you need to be relatively fluent in writing R code. You need to have a strong working command of the syntax. This means that you should be able to write the code for basic tools rapidly and from memory.

Mastery isn’t only important for “getting things done.” Mastery of basic syntax is also important for getting hired. To get hired as a data scientist, you need to be somewhat fluent. At the very least, you’ll need to be fluent in the “core” skills.

What do you think will happen if you walk into a data science interview and someone asks you to write the code for a scatterplot …

Do you think that it’s enough to tell them, “yeah, I watched a video on that once and typed the code a few times, but I don’t really remember how to do it”?

Uh … you won’t get the job.

To get a job and to be effective in a job, you will need to master “core” data science skills. You will need to have a strong fluency in the syntax for data manipulation, data visualization, and data analysis.

If you want a data science job, and perform well in that job, mastering the skill set is critical.

TO MASTER DATA SCIENCE, YOU NEED TO STOP FORGETTING

To achieve this level of mastery – to remember the syntax and write code from memory – you need to stop forgetting.

Ask yourself, how much time have you spent trying to learn and master data science? Do you remember what you’ve learned?

My guess is that out of all of the syntax you’ve “learned” by reading a blog post or cutting-and-pasting code, you remember only a small fraction.

And I bet it’s frustrating. You’re wasting time! If you weren’t forgetting so much of what you learn, you’d probably have mastered R and Python by now!

This is the problem.

And to be clear, I’m not trying to berate you. We all forget. This is just how the human mind works. But like it or not, it’s a problem.

To master data science, you need to stop forgetting what you learn.

REPETITION IS THE KEY TO MEMORIZING SYNTAX

I’ll simply tell you the secret to stop forgetting.

It’s the secret to remembering what you’ve learned. It’s the secret to mastering data science syntax, or any skill for that matter.

The secret to remembering what you learn is repetition.

Repetition is the key. You need to repeat your practice.

TOP PERFORMERS REPEAT THEIR PRACTICE ACTIVITIES

If you look at top performers of all kinds, they repeat their practice activities. Top performers know that the secret to mastery is repetition. Relentless repetition. Repetition until the skill becomes second nature. Repetition until you’ve achieved “unconscious competence.” Then a little more repetition, just to be sure.

ATHLETES REPEAT THEIR PRACTICE

In his book Relentless, elite basketball trainer Tim Grover – the famous trainer of Kobe Bryant and Michael Jordan – explained that guys like Michael and Kobe practiced basketball shots over and over again. He notes that on some days, an elite player might practice a single shot thousands of times in a single day. Thousands of repetitions. Michael Jordan wasn’t born the best, he earned it with millions of repetitions over years of training.

MUSICIANS REPEAT THEIR PRACTICE

We see something similar among musicians. Elite violinists and musicians are known to practice a single “phrase” of a song over and over. At first, they might just practice to learn and memorize the phrase. At first, they’re just trying to execute the musical phrase correctly. But as they move on, they try to refine it and add a bit of finesse. For musicians, the process of memorization and refinement comes from careful repetition.

From personal experience, I can also tell you that learning a musical instrument is strongly dependent on repetition. You have to repeat scales and techniques until you can execute them fluidly. You repeat them until they are second nature. Eventually, you repeat them until you never forget them (I can still play songs that I learned years ago).

NAVY SEALS REPEAT THEIR PRACTICE

Navy SEALs – who are renowned for their ability to succeed in tough battle conditions – are also relentless trainers. They succeed because they prepare. SEALs make a point to train their skills until they have them completely memorized. Then they train some more.

Here’s a quote by retired Navy SEAL, Rob Roy:

“Repetition leads to memorization and memorization leads to instinct. Therefore, one must train and train their skills until they know a procedure cold. And then they must train some more.”

I love this quote. It’s nominally about training combat techniques, but it applies to almost any skill. The secret to becoming great at anything is repetition. It doesn’t matter if you want to be a soldier or a data scientist.

If you want to walk into a data science interview and astound them with how well you know the code, you need to practice syntax. You need to repeat the syntax you learn. If you want to be fluent in writing data science code, and if you eventually want to be one of the best, you need repetition.

TO MASTER DATA SCIENCE, YOU NEED TO REPEAT YOUR PRACTICE

To be a great data scientist, you need to have some core syntax memorized. You need to essentially memorize the code to perform basic techniques: data manipulation, data visualization, data analysis.

Will you occasionally forget a few things. Yes. Sometimes. But there are some things that you should know cold. For example, you should be able to write the code to filter a dataset without hesitation. You should be able to write the code to create a scatterplot without flinching. You should have these memorized.

Being a great data scientist means that you need to have some things memorized. That’s what it takes to get things done.

To master data science, you need to repeat your practice. CLICK TO TWEET

To memorize the syntax, you need to repeat it. Over and over. And then train a bit more …

REPETITION REQUIRES A PRACTICE SYSTEM

Repeating your practice activities requires you to be systematic. You can’t be haphazard in how you train. If you don’t train in a systematic way, you’re greatly increasing your chances of failure.

This shouldn’t surprise you. Many skills have practice drills that are used to help people learn and master that skill.

For example, musicians often have drills to practice scales, musical phrases, chords, and arpeggios.

Basketball players have passing drills or drills to practice a particular shot over and over.

The best performers are systematic in how they train.

If you want to be great data scientist, you need to be systematic as well.

The problem here is that almost no one trains this way. As I’ve noted before, many new data science students use the “jump in and build something” method of data science training. They just jump in and start a big project. This would be like trying to learn basketball by just playing a few games every now and again. Yes, you can learn a little, but you’ll never be as good as someone who drills and trains the details to mastery.

To be clear, data science projects can be good later in your data science training. Projects are a great way to integrate what you’ve learned, after you’ve mastered the basics. But projects are completely inefficient for mastering basic syntax. The reason is that when you work on a project, there’s not enough repetition. In a project, there are many pieces of that you will only use once or twice. That’s not enough repetition to memorize the code snippet.

No. Projects aren’t the best way to master data science because they don’t give you enough repetition. You need a structured practice system.

SIGN UP FOR OUR EMAIL LIST NOW

In our paid training courses at Sharp Sight, we will show you just such a practice system. Our courses teach you a specialized training methodology for repeating data science syntax until you have it memorized.

Many of our students report becoming “fluent” in writing data science code. They write code from memory.

If you want to learn how to practice data science … how to repeat your data science practice so you memorize the syntax, sign up for our email newsletter and enroll in our course when it opens.

Our training courses only open up a few times per year, and the enrollment details are sent exclusively to people on our email list.

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 156,757评论 4 359
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 66,478评论 1 289
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 106,540评论 0 237
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 43,593评论 0 203
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 51,903评论 3 285
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 40,329评论 1 210
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 31,659评论 2 309
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 30,383评论 0 195
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 34,055评论 1 238
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 30,337评论 2 241
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 31,864评论 1 256
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 28,227评论 2 251
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 32,820评论 3 231
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 25,999评论 0 8
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 26,750评论 0 192
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 35,365评论 2 269
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 35,260评论 2 258

推荐阅读更多精彩内容

  • rljs by sennchi Timeline of History Part One The Cognitiv...
    sennchi阅读 7,096评论 0 10
  • 越接近下班,心情就越沮丧。今天看到一句话,想变得跟好却总是付出得太少。我应该就是这样吧。我不应该用猜测的语气,我...
    古月七七阅读 219评论 0 1
  • “我忘不了他。 他要走了,我是不是应该打电话给他” 告诉他你爱他还是祈求继续和他在一起?姑娘挺起你的腰板,踮踮脚跟...
    筠溪阅读 269评论 0 0
  • 我爱超越家族,深深的爱。 人生得意须尽欢,莫使金樽空对月。 Merry Christmas !
    美喵先丽阅读 87评论 0 1