109 10 82

Noa Roggendorff

nroggendorff

https://sly.sh/

AI & ML interests

None

Recent Activity

updated a dataset 9 days ago

nroggendorff/aligator

replied to their post 17 days ago

I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)

replied to their post 17 days ago

I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)

View all activity

Organizations

nroggendorff's activity

updated a dataset 9 days ago

nroggendorff/aligator

Viewer • Updated 9 days ago • 70k • 28

replied to their post 17 days ago

Ah we can work with that, then the issue is that the space is incomplete/misconfigured, (i would reccomend amending your original post to avoid confusion).

I just read your blog post: https://huggingface.co/blog/nroggendorff/train-with-llama-architecture

It provides some useful context, thanks.

From reading the dockerfile and image file, it appears that cuda was never included in the image.

You may find the following resources helpful for using docker with spaces:
https://huggingface.co/docs/hub/en/spaces-sdks-docker

If you are using cuda, this may also help inform on how to setup cuda, and also test if cuda works (with docker):
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

References
https://huggingface.co/spaces/nroggendorff/train-llama/blob/main/Dockerfile

https://hub.docker.com/layers/nroggendorff/train-llama/latest/images/sha256-8cd7859f8a7cc3b669b344e87fa342e3c464e449141e267fbb52cfb48c32310f

Hope you find this helpful,
Let me know if you have any more questions, let me know here or email me.

The base image for that Dockerfile has Cuda installed and configured.

You are welcome to open a PR with your proposed fix on https://github.com/nroggendorff/train-llama.

replied to their post 17 days ago

Ah we can work with that, then the issue is that the space is incomplete/misconfigured, (i would reccomend amending your original post to avoid confusion).

I just read your blog post: https://huggingface.co/blog/nroggendorff/train-with-llama-architecture

It provides some useful context, thanks.

From reading the dockerfile and image file, it appears that cuda was never included in the image.

You may find the following resources helpful for using docker with spaces:
https://huggingface.co/docs/hub/en/spaces-sdks-docker

If you are using cuda, this may also help inform on how to setup cuda, and also test if cuda works (with docker):
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html

References
https://huggingface.co/spaces/nroggendorff/train-llama/blob/main/Dockerfile

https://hub.docker.com/layers/nroggendorff/train-llama/latest/images/sha256-8cd7859f8a7cc3b669b344e87fa342e3c464e449141e267fbb52cfb48c32310f

Hope you find this helpful,
Let me know if you have any more questions, let me know here or email me.

The base image for that Dockerfile has Cuda installed and configured.

replied to their post 22 days ago

I am not sure if that makes sense, I am under the impression that, if the space is not running(not started), no models can be actively loaded in the space.

Can you share your relevant workflow(docker-compose, app code, etc) so i can see more clearly whats happening?

I might be able to aid in a solution, its possible that there is an issue in the workflow itself.

EDIT: I looked at the spaces, Do you mean this space as an example? 'https://huggingface.co/spaces/nroggendorff/train-llama'
Because this space shows a missing "CUDA_HOME" env var, most your other spaces throwing errors about missing CUDA drivers or are paused. These are configuration errors.

Could you tell me the space and error message?
I might be able to help you fix it.

That’s the one.

replied to their post 22 days ago

what the~

replied to their post 24 days ago

it's pretty specific to my workflow, but spaces now don't get cuda until after they start, so you can't load models or anything until an app is running

reacted to their post with ❤️🤗🚀 27 days ago

Post

2376

I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)

11 replies

posted an update 27 days ago

Post

2376

I'm not really doing much on HuggingFace right now due to their new Docker space policies, so if you want to keep up with most of what I'm up to, follow my [instagram](https://sly.sh/ig)

11 replies

replied to their post about 1 month ago

403

reacted to their post with 🤗🚀 about 1 month ago

Post

3630

200

8 replies

posted an update about 1 month ago

Post

3630

200

8 replies

reacted to clem's post with 🤗 about 1 month ago

Post

4647

We just crossed 1,500,000 public models on Hugging Face (and 500k spaces, 330k datasets, 50k papers). One new repository is created every 15 seconds. Congratulations all!

3 replies

reacted to their post with 😔🔥 about 1 month ago

Post

2777

to the nvidia employee that won't respond to my emails: hear me now.

you have made a semi-powerful to irrelevant enemy. you have been warned

4 replies

posted an update about 1 month ago

Post

2777

to the nvidia employee that won't respond to my emails: hear me now.

you have made a semi-powerful to irrelevant enemy. you have been warned

4 replies

posted an update about 2 months ago

Post

636

nroggendorff/vfs was given a description, but nroggendorff/pfs wasn’t. Why?

replied to their post about 2 months ago

it's a similar architecture to image generation, so.. kinda? diffusion llms aren't very popular though, so there isn't a ton of research on them. transformers is a much more reliable model type for now.

edit: it's not really a super serious experiment, they are more for testing if a logical response is possible this way.
this is also kinda why q and a bots are really bad, people just found that that format doesn't scale very well at all

edit 2: (i said one of, because another huge reason is quality data scarcity and lack of flexibility. with incremental models like gpts, you can have any number of roles and stuff, whereas input-output models just have that)