展示广告点击率预估方法研究
更新时间:2023-04-06 06:11:01 阅读量: 教育文库 文档下载
- 对某广告的点击率进行预估推荐度:
- 相关推荐
Abstract
With the rapid development of the Internet, online advertising acts a pivotal part in the Internet in our daily life and it has become the most popular approach to do brand promotion and product marketing for the advertiser. Accurate click-through rate (CTR) prediction is the most important part of online advertising. Improving the accuracy of the ads' CTR estimation can not only benefit to advertisers, but also improve user experience.
Many traditional click through rate prediction methods, such as logistic regression, have been applied to advertising click rate prediction system and achieved good results. Furthermore, it has been large-scale deployed in the industry. Recently, the deep learning technology has achieved great success in multiply fields of Natural Language Processing and Computer Vision, such as Textual Entailment, Text Summarization, Image Generation and so on. Meanwhile, a number of deep learning models right now have been used in personalized recommender system and CTR prediction and their model structures are similar. Both of them reduce the dimension of the feature by vectorization, then utilize nonlinear operation to extract the feature combination, and calculate nonlinear relationship between the features and the click rate through by neural network. The content of this paper of the following three main aspects:
(1) Ensemble Learning by multiple traditional machine learning models based CTR model. We first do feature engineering on two large-scale real-world display advertising datasets manually and extract high-order combination feature by GBDT. Then we calculate CTR by mature machine learning models such as logistic regression and factorization machine. Then we utilize ensemble learning base on multiple single models. Finally, we calculate the result of ensemble learning method.
(2) Advance deep learning model based CTR model. We use deep neural network and recurrent neural network to do click-through rate prediction. We try to combine the features extracted from feature engineering and get the input of deep neural network through feature hashing and feature connection. Finally, we calculate the result of advance deep learning model.
(3) Multi-Embedding deep model based CTR model. We propose a novel CTR predicting model, Multi-Embedding Deep Model. We implement deep neural network based and convolutional neural network based traditional multi-embedding deep model, and also implement deep neural network based and convolutional neural network based bilinear multi-embedding deep model. which
- II -
we utilize bilinear matrix to do feature interactions instead of factorization machines. We design a system to address the cold-start problem for static data set by combining clustering method and marking rare embedding vectors method. We evaluate the proposed model on IPinYou and Avazu datasets, two large-scale real-world display advertising datasets. Experimental results show that the model can improve the estimation performance of ads' click-through rate effectively.
Keywords:online advertising, click-through rate, deep learning, convolutional neural network, bilinear
- III -
目录
摘要 .......................................................................................................................... I ABSTRACT ................................................................................................................ II 第1章绪论 .. (1)
1.1课题的来源及研究的目的和意义 (1)
1.1.1 课题的来源 (1)
1.1.2 课题的研究目的和意义 (1)
1.2国内外研究现状 (2)
1.2.1 基于机器学习的点击率预估模型研究现状 (2)
1.2.2 基于深度学习的点击率预估模型研究现状 (4)
1.3数据集与问题定义 (5)
1.3.1 数据集描述 (5)
1.3.2 点击率预估的问题定义 (9)
1.3.3 点击率预估的评价指标 (9)
1.3.4 基线系统选择 (11)
1.4本文的主要研究内容 (13)
1.5本文内容安排 (14)
第2章基于模型融合的点击率预估研究 (15)
2.1引言 (15)
2.2单模型点击率预估 (15)
2.2.1 GBDT高阶特征组合模型 (15)
2.2.2 FM点击率预估模型 (18)
2.2.3 FFM点击率预估模型 (19)
2.3集成学习点击率预估 (20)
2.3.1 强模型融合 (20)
2.3.2 机器学习元算法 (21)
2.4基于模型融合的点击率预估模型 (23)
2.5实验结果与分析 (24)
2.5.1 模型参数设置 (24)
2.5.2 实验结果对比分析 (25)
2.6本章小结 (27)
- IV -
第3章基于深度学习的点击率预估研究 (29)
3.1引言 (29)
3.2基于传统深度模型的点击率预估研究 (29)
3.2.1 激活函数 (29)
3.2.2 Dropout (30)
3.2.3 Batch Normalization (31)
3.2.4 反向传播算法 (33)
3.2.5 基于传统深度神经网络的点击率预估模型 (34)
3.3基于循环神经网络的点击率预估研究 (34)
3.3.1 循环神经网络 (35)
3.3.2 长短期记忆网络 (36)
3.3.3 门控循环单元 (37)
3.3.4 双向循环神经网络 (38)
3.3.5 基于时间的反向传播算法 (39)
3.3.6 基于循环神经网络的点击率预估模型 (40)
3.4浅层特征与深层特征结合的点击率预估模型 (41)
3.5实验结果与分析 (41)
3.5.1 模型参数设置 (42)
3.5.2 实验结果对比分析 (42)
3.6本章小结 (46)
第4章基于MULTI-EMBEDDING的点击率预估研究 (48)
4.1引言 (48)
4.2卷积神经网络相关技术研究 (48)
4.2.1 卷积层 (48)
4.2.2 池化层 (50)
4.3双线性特征组合 (51)
4.4冷启动问题模型 (52)
4.5基于传统M ULTI-E MBEDDING的点击率预估模型 (53)
4.5.1 基于深度神经网络的传统Multi-Embedding点击率预估模型 (53)
4.5.2 基于卷积神经网络的传统Multi-Embedding点击率预估模型 (54)
4.6基于双线性M ULTI-E MBEDDING的点击率预估模型 (55)
4.6.1 基于深度神经网络的双线性Multi-Embedding点击率预估模型 (55)
4.6.2 基于卷积神经网络的双线性Multi-Embedding点击率预估模型 (56)
4.7实验结果与分析 (57)
- V -
4.7.1 模型参数设置 (57)
4.7.2 实验结果对比分析 (57)
4.8本章小结 (61)
结论 (62)
参考文献 (65)
攻读硕士学位期间发表的论文及其它成果 (72)
哈尔滨工业大学学位论文原创性声明和使用权限 (73)
致谢 (74)
- VI -
正在阅读:
展示广告点击率预估方法研究04-06
网上卖药短距物流缩短最后一公里06-03
英语介绍公司各个部门职责07-23
建筑施工机械设备系统安装操作与生产基地安全作业质量技术标准及04-10
4-3《多彩服饰》教案06-14
《单片机原理及应用》期末复习题103-11
厨师岗位练兵试题02-28
数字电子技术基础 第一章01-19
- exercise2
- 铅锌矿详查地质设计 - 图文
- 厨余垃圾、餐厨垃圾堆肥系统设计方案
- 陈明珠开题报告
- 化工原理精选例题
- 政府形象宣传册营销案例
- 小学一至三年级语文阅读专项练习题
- 2014.民诉 期末考试 复习题
- 巅峰智业 - 做好顶层设计对建设城市的重要意义
- (三起)冀教版三年级英语上册Unit4 Lesson24练习题及答案
- 2017年实心轮胎现状及发展趋势分析(目录)
- 基于GIS的农用地定级技术研究定稿
- 2017-2022年中国医疗保健市场调查与市场前景预测报告(目录) - 图文
- 作业
- OFDM技术仿真(MATLAB代码) - 图文
- Android工程师笔试题及答案
- 生命密码联合密码
- 空间地上权若干法律问题探究
- 江苏学业水平测试《机械基础》模拟试题
- 选课走班实施方案
- 方法研究
- 预估
- 点击率
- 展示
- 广告
- 「优质」2022-2022学年度第一学期五年级语文上册第四单元
- 安保工程施工组织设计
- 【新整理】 七年级英语下册Unit1Canyouplaytheguitar短语语法知
- 水稳基层监理细则水稳控制要点
- 汽车成型顶棚项目可行性研究报告评审方案设计(2013年发改委标准
- (精选)群体性事件处置案例分析和启示
- 四川近九成城乡居民对当前生活表示满意
- 2016最新新目标(Go for it)版九年级英语第1次月考
- 2013新版自考英语(二)讲义完整版
- 档案信息化基础理论与实践-测试
- “两学一做”学习教育常态化制度化应知应会题
- 四年级信息技术上册第一课《信息与信息技术》《认识信息技术》教
- ProKey编程交流论坛正式开放注册-贴吧宣传文档第一稿
- 2022-2022年初中物理人教版《八年级下》《第七章 力》精选专题试
- 《马克思主义基本原理概论》期末考试重点知识
- 人教B数学必修四课时分层作业 用平面向量坐标表示向量共线条件
- 公共机构绿色食堂评价导则.pdf
- 中考中数学如何取得好成绩
- 检验科停电应急预案通用版
- 限位板模具设计说明书模板