当前位置:网站首页>Watermelon book chapter 2 Notes - Performance Measurement
Watermelon book chapter 2 Notes - Performance Measurement
2022-07-21 22:41:00 【kurok_】
This article only discusses the measurement methods of prediction tasks .
Return to the task
The mean square error is often used in regression tasks :
A more general , For data distribution D And the probability density function p(), The mean square error can be expressed as
Classification task
1. Error rate and accuracy
Error rate : The proportion of the total number of samples in the sample station with wrong classification
precision :1- Error rate
There is also a form of probability distribution , Similar to the return mission .
2. Precision rate 、 Recall rate 、F1
Real examples (TP)、 False positive example (FP)、 True counter example (TN)、 False counter example (FN)
Precision rate P=TP/(TP+FP), In the positive example of judgment , It's really the proportion of positive examples
Recall rate R=TP/(TP+FN), In all positive examples , The proportion judged to be true
P-R curve , Can roughly judge which model is good , Generally, if A All inclusive B shows A good , But it's rare , You can compare the area , It can also be compared Balance point (BEP), namely P=R The value of time , The bigger the better .
however BEP It's still too simplified , What is commonly used is F1,F1 yes P and R Of Harmonic mean
In some cases, more attention may be paid to precision or recall . You can give F1 Assign different weights , as follows :
\beta Greater than 1 When the recall rate is high , Less than 1 The hourly accuracy rate accounts for a high proportion .
in addition :macroF1 Indicators like that , See watermelon book P32.
3.ROC And AUC
True case rate TPR=TP/(TP+FN), In all positive examples , The proportion judged
The false positive rate is FPR=FP/(TN+FP), Among the samples with wrong judgment , The proportion of positive cases
Choose different decision thresholds , You can draw multiple TPR and FPR
With TPR and FPR The graph drawn for the coordinate axis is ROC diagram ,ROC The area of the curvilinear siege is AUC.
AUC The meaning of ? In form , Consider the sample prediction is the sorting quality .
From the definition of sequencing loss (P35), If the prediction of the positive example is less than that of the negative example , Then write down a penalty , If equal, then remember 0.5 Penalty points .
AUC=1- Sort loss( Understand graphically )
4. Cost sensitive
Add the probability term corresponding to the cost matrix and the evaluation standard .
边栏推荐
- 148. Sorting linked list
- 19. 删除链表的倒数第 N 个结点
- One article explains the problem of data fragmentation in distributed systems
- 01 knapsack interview questions series (I)
- 「跑象科技」获得天使+融资,打造新一代实时数据基础平台
- 844. Compare strings with backspace
- 3级学业水平测试
- 堆-原理到应用——堆排序、优先级队列
- Datart data visualization works are open source | chart plug-in works are all open source, which can be extracted through Baidu cloud download link
- 62. Different paths
猜你喜欢
归并排序
企业如何做好数据管理?产品选型怎么做?
226. 翻转二叉树
1046. Weight of the last stone
One article explains the problem of data fragmentation in distributed systems
Further learning of 02 selenium (control browser window +)
beta.4 版发布啦,国产开源数据可视化 datart 接下来将会小跑进入 rc 阶段
205. Isomorphic string
567. 字符串的排列
438. Find all letter ectopic words in the string
随机推荐
组队
Do you think sub database and sub table are really suitable for your system? Talk about how to select sub databases, sub tables and newsql
005: storage space size of integer data type
子序列
328. Parity linked list
Interviewer: have you made a judgment on the number of rows affected by your update?
What are the characteristics and application fields of Bluetooth module in the Internet of things?
06 page object + pytest unit test framework
567. Arrangement of strings
VisualStudio2019 配置点云库 PCL1.11.1+斯坦福兔子测试
2020普及组总结
Chat about matter protocol (original chip protocol)
组合学总结
归并排序
In depth analysis of multiple knapsack problem (Part 2)
"Running image technology" obtained Angel + financing to build a new generation of real-time data infrastructure platform
2022ACM夏季集训周报(三)
干货 | 分布式系统的数据分片难题
D - AND and SUM (AtCoder Beginner Contest 238)
堆-原理到应用——堆排序、优先级队列