当前位置: 首页 > news >正文

mamba,mamba2环境搭建

mamba和mamba2安装步骤的相关代码

conda create -n mamba_test python=3.10
conda activate mamba_test
conda install cudatoolkit=11.8 -c https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main/linux-64/pip install mamba_ssm-2.2.2+cu118torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whl 
pip install causal_conv1d-1.4.0+cu118torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whl pip install  triton==2.1.0
pip install numpy==1.22.4

对应的whl文件的下载地址:
mamba_ssm下载
causal_conv1d下载

可以运行的mamba和mamba2测试代码:

import torch
from mamba_ssm import Mamba
batch, length, dim = 2, 64, 16
x = torch.randn(batch, length, dim).to("cuda")
model = Mamba(# This module uses roughly 3 * expand * d_model^2 parametersd_model=dim,  # Model dimension d_modeld_state=16,  # SSM state expansion factord_conv=4,  # Local convolution widthexpand=2,  # Block expansion factor
).to("cuda")
y = model(x)
print("Mamba result", y.shape)
assert y.shape == x.shapeimport torch
from mamba_ssm import Mamba2batch, length, dim = 2, 64, 512
x = torch.randn(batch, length, dim).to("cuda")
model = Mamba2(# This module uses roughly 3 * expand * d_model^2 parameters# make sure d_model * expand / headdim = multiple of 8d_model=dim,  # Model dimension d_modeld_state=64,  # SSM state expansion factor, typically 64 or 128d_conv=4,  # Local convolution widthexpand=2,  # Block expansion factorheaddim=64,  # default 64
).to("cuda")
y = model(x)
print("Mamba2 result", y.shape)
assert y.shape == x.shape

可以参考的调试步骤:
Mamba-2 Error: ‘NoneType‘ object has no attribute ‘causal_conv1d_fwd‘

mamba_ssm和causal-conv1d安装教程


http://www.mrgr.cn/news/92330.html

相关文章:

  • Python批量压缩并上载CSV数据文件到Box企业云盘
  • kvaser pythoncan 调用报错(x86 ubuntu工控机)
  • Metal 学习笔记四:顶点函数
  • Python学习第十七天之PyTorch保姆级安装
  • TCPDF 任意文件读取漏洞:隐藏在 PDF 生成背后的危险
  • (0)阿里云大模型ACP-考试回忆
  • day7作业
  • kubernetes-完美下载
  • 2025-02-25 学习记录--C/C++-用C语言实现删除字符串中的子串
  • Redis存储​⑫​哨兵Sentinel_高可用实现方案
  • 智慧交通之信号控制【绿波带】
  • 大厂都在用的前端换肤方案:多品牌适配 + 夜间模式全解析
  • java.lang.IllegalStateException: dbType not support : null, url null
  • 本地开发用ASP.NET Core Web API项目创建及测试
  • low rank decomposition如何用于矩阵的分解
  • Flutter - 基础Widget
  • 2024最新版鸿蒙纯血原生应用开发教程文档丨学习ArkTS语言-基本语法
  • AI绘画软件Stable Diffusion详解教程(2):Windows系统本地化部署操作方法(专业版)
  • OkHttp、Retrofit、RxJava:一文讲清楚
  • 单目摄像头物体深度计算基础原理