Granite Time Series Collection Time series models for forecasting, anomaly detection, classification, and more. • 10 items • Updated about 6 hours ago • 51
Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers Paper • 2311.10642 • Published Nov 17, 2023 • 25