当前位置：网站首页>斯坦福CV课程学习笔记（一）

斯坦福CV课程学习笔记（一）

2022-07-21 10:49:00 【dantamiao】

视频源自

作为一个设计学出身企图踏足人工智能领域的小白，听说这门课很OK于是来听课了。希望给非科班出身的同学们一些帮助。

以下是Lecture2 image classification的笔记

Nearest neighbor classifier

在本次课程中将会使用python3和numpy, 课程附上了简单的入门教学，地址如下：

http://cs231n.github.io/python-numpy-tutorial/

曼哈顿距离-Manhuttan distance L1

(30条消息) 机器学习中的数学——距离定义（二）：曼哈顿距离（Manhattan Distance）_von Neumann的博客-CSDN博客_机器学习曼哈顿距离https://blog.csdn.net/hy592070616/article/details/121569933?spm=1001.2014.3001.5501这篇文章介绍的非常详细，除了曼哈顿距离还有常用的欧几里得和余弦距离。

总体来说就是两个图像每个块的值相减，再加在一起，456就是两个图像的差了。

nearest neighbor clssifier的问题就在于，测试所用计算时间太长。而卷积神经网络训练时间长，但是测试非常快。

K-Nearest neighbors

K越大边界越平滑，大部分情况我们使用大于1的K值。白色区域中没有预测值。

欧几里得距离-Euclidean distance L2

图片相似性计算中用欧几里得的也比较多。那如何选择不同的方法？

视频原话：

If your input features, if the individual entries in your vector have some important meaning for you ask, then maybe somehow L1 might be a more natural fit. But if it's just a generic vector in some space and you don't know which of the different elements, you don't know what they actually mean, then maybe L2 is slightly more natural.
如果您的输入特征，如果您的向量中的各个条目对您有重要意义，那么也许 L1 可能更自然。但是如果它只是某个空间中的一个通用向量并且你不知道哪些不同的元素，你不知道它们的实际含义，那么 L2 可能会稍微自然一些。