Post-transformer inference: 224× compression of Llama-70B with improved accuracy (zenodo.org)
14 points by anima-core 2 hours ago | 6 comments
6114 points by anima-core 2 hours ago | 6 comments
612 points by indigodaddy 2 hours ago | 1 comment
622 points by skilldeliver 2 hours ago | 0 comments
633 points by zerosizedweasle 2 hours ago | 0 comments
642 points by mfiguiere 2 hours ago | 0 comments
653 points by andsoitis 2 hours ago | 2 comments
662 points by tank-34 2 hours ago | 1 comment
671 points by Jeannen 2 hours ago | 0 comments
685 points by tanelpoder 2 hours ago | 0 comments
692 points by thunderbong 3 hours ago | 0 comments
702 points by BSTRhino 3 hours ago | 1 comment
716 points by donohoe 3 hours ago | 1 comment
721 points by jjuliobit 3 hours ago | 0 comments
733 points by mgh2 3 hours ago | 0 comments
742 points by mgh2 3 hours ago | 0 comments
7528 points by hdk 3 hours ago | 18 comments
768 points by mhb 3 hours ago | 0 comments
771 points by malachi_dev 3 hours ago | 1 comment
781 points by deathnail298 3 hours ago | 1 comment
792 points by handfuloflight 3 hours ago | 0 comments
803 points by jjgreen 3 hours ago | 0 comments
811 points by olivato 3 hours ago | 0 comments
821 points by birdculture 3 hours ago | 0 comments
831 points by transpute 3 hours ago | 0 comments
844 points by mhb 3 hours ago | 1 comment
851 points by jbuse 3 hours ago | 0 comments
861 points by maoaeiou 3 hours ago | 1 comment
871 points by heihieih 3 hours ago | 0 comments
882 points by fcpguru 3 hours ago | 0 comments
893 points by nimbius 3 hours ago | 0 comments
90