You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
| Deep Session Interest Network |[IJCAI 2019][Deep Session Interest Network for Click-Through Rate Prediction ](https://arxiv.org/abs/1905.06482)|
55
55
| FiBiNET |[RecSys 2019][FiBiNET: Combining Feature Importance and Bilinear feature Interaction for Click-Through Rate Prediction](https://arxiv.org/pdf/1905.09433.pdf)|
56
56
| FLEN |[arxiv 2019][FLEN: Leveraging Field for Scalable CTR Prediction](https://arxiv.org/pdf/1911.04690.pdf)|
57
+
| BST |[DLP-KDD 2019][Behavior sequence transformer for e-commerce recommendation in Alibaba](https://arxiv.org/pdf/1905.06874.pdf)|
57
58
| DCN V2 |[arxiv 2020][DCN V2: Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems](https://arxiv.org/abs/2008.13535)|
58
59
59
60
## Citation
@@ -82,3 +83,38 @@ If you find this code useful in your research, please cite it using the followin
82
83
83
84

84
85
86
+
87
+
## Main contributors([welcome to join us!](./CONTRIBUTING.md))
Copy file name to clipboardexpand all lines: deepctr/estimator/models/afm.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
"""
3
3
4
4
Author:
5
-
Weichen Shen, wcshen1994@163.com
5
+
Weichen Shen, weichenswc@163.com
6
6
7
7
Reference:
8
8
[1] Xiao J, Ye H, He X, et al. Attentional factorization machines: Learning the weight of feature interactions via attention networks[J]. arXiv preprint arXiv:1708.04617, 2017.
Copy file name to clipboardexpand all lines: deepctr/estimator/models/ccpm.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
"""
3
3
4
4
Author:
5
-
Weichen Shen, wcshen1994@163.com
5
+
Weichen Shen, weichenswc@163.com
6
6
7
7
Reference:
8
8
[1] Liu Q, Yu F, Wu S, et al. A convolutional click prediction model[C]//Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 2015: 1743-1746.
Copy file name to clipboardexpand all lines: deepctr/estimator/models/dcn.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# -*- coding:utf-8 -*-
2
2
"""
3
3
Author:
4
-
Weichen Shen, wcshen1994@163.com
4
+
Weichen Shen, weichenswc@163.com
5
5
6
6
Reference:
7
7
[1] Wang R, Fu B, Fu G, et al. Deep & cross network for ad click predictions[C]//Proceedings of the ADKDD'17. ACM, 2017: 12. (https://arxiv.org/abs/1708.05123)
Copy file name to clipboardexpand all lines: deepctr/estimator/models/deepfm.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# -*- coding:utf-8 -*-
2
2
"""
3
3
Author:
4
-
Weichen Shen, wcshen1994@163.com
4
+
Weichen Shen, weichenswc@163.com
5
5
6
6
Reference:
7
7
[1] Guo H, Tang R, Ye Y, et al. Deepfm: a factorization-machine based neural network for ctr prediction[J]. arXiv preprint arXiv:1703.04247, 2017.(https://arxiv.org/abs/1703.04247)
Copy file name to clipboardexpand all lines: deepctr/estimator/models/fnn.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# -*- coding:utf-8 -*-
2
2
"""
3
3
Author:
4
-
Weichen Shen, wcshen1994@163.com
4
+
Weichen Shen, weichenswc@163.com
5
5
6
6
Reference:
7
7
[1] Zhang W, Du T, Wang J. Deep learning over multi-field categorical data[C]//European conference on information retrieval. Springer, Cham, 2016: 45-57.(https://arxiv.org/pdf/1601.02376.pdf)
Copy file name to clipboardexpand all lines: deepctr/estimator/models/nfm.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# -*- coding:utf-8 -*-
2
2
"""
3
3
Author:
4
-
Weichen Shen, wcshen1994@163.com
4
+
Weichen Shen, weichenswc@163.com
5
5
6
6
Reference:
7
7
[1] He X, Chua T S. Neural factorization machines for sparse predictive analytics[C]//Proceedings of the 40th International ACM SIGIR conference on Research and Development in Information Retrieval. ACM, 2017: 355-364. (https://arxiv.org/abs/1708.05027)
Copy file name to clipboardexpand all lines: deepctr/estimator/models/pnn.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# -*- coding:utf-8 -*-
2
2
"""
3
3
Author:
4
-
Weichen Shen, wcshen1994@163.com
4
+
Weichen Shen, weichenswc@163.com
5
5
6
6
Reference:
7
7
[1] Qu Y, Cai H, Ren K, et al. Product-based neural networks for user response prediction[C]//Data Mining (ICDM), 2016 IEEE 16th International Conference on. IEEE, 2016: 1149-1154.(https://arxiv.org/pdf/1611.00144.pdf)
Copy file name to clipboardexpand all lines: deepctr/estimator/models/wdl.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
# -*- coding:utf-8 -*-
2
2
"""
3
3
Author:
4
-
Weichen Shen, wcshen1994@163.com
4
+
Weichen Shen, weichenswc@163.com
5
5
6
6
Reference:
7
7
[1] Cheng H T, Koc L, Harmsen J, et al. Wide & deep learning for recommender systems[C]//Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. ACM, 2016: 7-10.(https://arxiv.org/pdf/1606.07792.pdf)
""" Simplified version of Transformer proposed in 《Attention is all you need》
418
418
419
419
Input shape
420
-
- a list of two 3D tensor with shape ``(batch_size, timesteps, input_dim)`` if supports_masking=True.
421
-
- a list of two 4 tensors, first two tensors with shape ``(batch_size, timesteps, input_dim)``,last two tensors with shape ``(batch_size, 1)`` if supports_masking=False.
420
+
- a list of two 3D tensor with shape ``(batch_size, timesteps, input_dim)`` if ``supports_masking=True`` .
421
+
- a list of two 4 tensors, first two tensors with shape ``(batch_size, timesteps, input_dim)``,last two tensors with shape ``(batch_size, 1)`` if ``supports_masking=False`` .
422
422
423
423
424
424
Output shape
425
-
- 3D tensor with shape: ``(batch_size, 1, input_dim)``.
425
+
- 3D tensor with shape: ``(batch_size, 1, input_dim)`` if ``output_type='mean'`` or ``output_type='sum'`` , else ``(batch_size, timesteps, input_dim)`` .
426
426
427
427
428
428
Arguments
@@ -436,14 +436,16 @@ class Transformer(Layer):
436
436
- **blinding**: bool. Whether or not use blinding.
437
437
- **seed**: A Python integer to use as random seed.
438
438
- **supports_masking**:bool. Whether or not support masking.
439
+
- **attention_type**: str, Type of attention, the value must be one of { ``'scaled_dot_product'`` , ``'additive'`` }.
440
+
- **output_type**: ``'mean'`` , ``'sum'`` or `None`. Whether or not use average/sum pooling for output.
439
441
440
442
References
441
443
- [Vaswani, Ashish, et al. "Attention is all you need." Advances in Neural Information Processing Systems. 2017.](https://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf)
Copy file name to clipboardexpand all lines: deepctr/models/afm.py
+1-1
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
"""
3
3
4
4
Author:
5
-
Weichen Shen, wcshen1994@163.com
5
+
Weichen Shen, weichenswc@163.com
6
6
7
7
Reference:
8
8
[1] Xiao J, Ye H, He X, et al. Attentional factorization machines: Learning the weight of feature interactions via attention networks[J]. arXiv preprint arXiv:1708.04617, 2017.
0 commit comments