Deep Learning

Introduction

The difficulties faced by systems relying on hard-coded knowledge suggestthat AI systems need the ability to acquire their own knowledge, by extracting patterns from raw data. This capability is known asmachine learning.

从原始数据中抽取模式,获取新知识的研究,被称作机器学习(Machine Learning)。

The performance of these simple machine learning algorithms depends heavilyon therepresentationof the data they are given. 
For example, when logistic regression is used to recommend cesarean delivery, the AI system does not examine the patient directly. 
Instead, the doctor tells the system several pieces of relevantinformation, such as the presence or absence of a uterine scar. 
Each piece ofinformation included in the representation of the patient is known as afeature.
Logistic regression learns how each of these features of the patient correlates withvarious outcomes. 
However, it cannot influence how features are defined in anyway. 
If logistic regression were given an MRI scan of the patient, rather thanthe doctor’s formalized report, it would not be able to make useful predictions.
Individual pixels in an MRI scan have negligible correlation with any complicationsthat might occur during delivery.

这段就讲的很好,logistic regression 严重依赖于从原始中获得的特征的质量。

读完这一段,突然有些明白了 AI 领域里面的层次关系:Knowledge base 尽量写好预先的规则;Machine learning 从拟定好的特征数据里面,学习一些决策规则;Representation learning 从原始数据里面学习特征?;Deep learning 就从原始数据里面学习特征,到最后的决策都有了。

ai-region

合理地处理数据,生成用于描述数据的特征,对机器学习来说十分重要。通过图 coordinate-matters 可以看出同样的问题,在不同的体系内,处理的难度是不一样的。

Representation learning

One solution to this problem is to use machine learning to discover not only the mapping from representation to output but also the representation itself.
This approach is known as representation learning.

为了解决人工构造特征的困境,就引入了 representation learning。这一研究领域不仅想学习出特征到输出结果的模型,还要学习出从原始数据到特征的模型。

两点好处

Learned representationsoften result in much better performance than can be obtained with hand-designedrepresentations. They also enable AI systems to rapidly adapt to new tasks, withminimal human intervention.
  1. 通常通过 representation learning 学习到的特征,都要比人工设计的特征要好。
  2. 只需要少量人工干预,就可以快速适应新任务。

设计特征的目标

When designing features or algorithms for learning features, our goal is usuallyto separate thefactors of variationthat explain the observed data.

设计特征时,我们的主要目标是获得变量的主要因素(the factors of variation)。这个因素可以等价理解为矩阵的秩。

This solution isto allow computers to learn from experience and understand the world in terms ofa hierarchy of concepts, with each concept defined through its relation to simplerconcepts. By gathering knowledge from experience, this approach avoids the needfor human operators to formally specify all the knowledge that the computer needs.
The hierarchy of concepts enables the computer to learn complicated concepts bybuilding them out of simpler ones. 
If we draw a graph showing how these concepts built on top of each other, the graph is deep, with many layers. For this reason,we call this approach to AI deep learning.

深度学习:通过经验数据,学习出一系列层次概念。每个概念又通过这个概念与一些简单的概念之间的关系进行定义。通过对经验数据的学习,避免了人工定义计算机所需的所有知识。这种架构使计算机可以在简单概念的基础上学习出复杂的概念。如果我们要用图来表示这些概念之间的关系,我们将得到一个非常深的、具有很多层的图。因此,我们称这种方法为深度学习(deep learning)。

hidden layers

Then a series ofhidden layersextracts increasingly abstractfeatures from the image. 
These layers are called “hidden” because their values are not givenin the data; instead the model must determine which concepts are useful for explainingthe relationships in the observed data.

隐藏层只所以叫隐藏层,是因为这些层的数据并不是直接来源于原始数据。 取而代之的是,由上层网络决定哪些从数据里面学习到的概念对解释数据是有效的。

However, deep learning can be safely regarded as thestudy of models that involve a greater amount of composition of either learnedfunctions or learned concepts than traditional machine learning does.

与传统的机器学习相比,深度学习可以被看作是对一系列组合模型的研究。 这些组合模型,是由学习出的函数或者概念组成的。

results matching ""

    No results matching ""