Introducing FalconMamba 7B: An attention-free 7B model which is pretty strong!
🤯Can process unlimited sequence lengths, outperforms traditional models, and fit on a single 24GB GPU.
Open-source and available on HF🤗. FalconMamba-7b Gradio Demo:https://huggingface.co/spaces/tiiuae/falcon-mamba-playground
🤯Can process unlimited sequence lengths, outperforms traditional models, and fit on a single 24GB GPU.
Open-source and available on HF🤗. FalconMamba-7b Gradio Demo:https://huggingface.co/spaces/tiiuae/falcon-mamba-playground