HF-hub - Share and discover more about AI with social posts from the community.huggingface/OpenAi
Share and discover more about AI with social posts from the community.huggingface/OpenAi
built a space for creating prompts for FLUX

gokaygokay/FLUX-Prompt-Generator


You can create long prompts from images or simple words. Enhance your short prompts with prompt enhancer. You can configure various settings such as artform, photo type, character details, scene details, style, and artist to create tailored prompts.

And you can combine all of them with custom prompts using llms (Mixtral, Mistral, Llama 3, and Mistral-Nemo).

The UI is a bit complex, but it includes almost everything you need. Choosing random option is the most fun!

And i've created some other spaces for using FLUX models with captioners and enhancers.

-
gokaygokay/FLUX.1-dev-with-Captioner

-
gokaygokay/FLUX.1-Schnell-with-Captioner
New feature 🔥
Image models and LoRAs now have little previews 🤏

If you don't know where to start to find them, I invite you to browse cool LoRAs in the profile of some amazing fine-tuners: @artificialguybr, @alvdansen, @DoctorDiffusion, @e-n-v-y, @KappaNeuro @ostris
The first open Stable Diffusion 3-like architecture model is JUST out 💣 - but it is not SD3! 🤔

It is
Tencent-Hunyuan/HunyuanDiT
by Tencent, a 1.5B parameter DiT (diffusion transformer) text-to-image model 🖼, trained with multi-lingual CLIP + multi-lingual T5 text-encoders for english 🤝 chinese understanding

Try it out by yourself here ▶️ https://huggingface.co/spaces/multimodalart/HunyuanDiT
(a bit too slow as the model is chunky and the research code isn't super optimized for inference speed yet)

In the paper they claim to be SOTA open source based on human preference evaluation! HunyuanDiT - a Hugging Face Space by Tencent-Hunyuan
🥳celebrating 5K readers in one of my blog posts 🥳
I came back with another one this time 🤓
in this blog you will learn 📖 :
* How to train custom AI models with the trainer API 🚀
* integrate your AI models with HF using the mixin classes 🔥

happy reading everyone 🤗
🔗link: https://huggingface.co/blog/not-lain/trainer-api-and-mixin-classes Train custom AI models with the trainer API and adapt them to 🤗
I will be delivering an introductory coding session this Sunday 7Pm gmt+1 time about huggingface, if you are new to HF and don't know where to begin, you are welcome to join us 🤗
📌Place: huggingface discord server
🔗Link : https://discord.gg/hugging-face-879548962464493619?event=1245406127668203541 Join the Hugging Face Discord Server!
It is with great pleasure I inform you that huggingface's ModelHubMixin reached 200+ models on the hub 🥳

ModelHubMixin is a class developed by HF to integrate AI models with the hub with ease and it comes with 3 methods :
* save_pretrained
* from_pretrained
* push_to_hub

Shoutout to @nielsr , @Wauplin and everyone else on HF for their awesome work 🤗

If you are not familiar with ModelHubMixin and you are looking for extra resources you might consider :
* docs: https://huggingface.co/docs/huggingface_hub/main/en/package_reference/mixins
🔗blog about training models with the trainer API and using ModelHubMixin: https://huggingface.co/blog/not-lain/trainer-api-and-mixin-classes
🔗GitHub repo with pip integration: https://github.com/not-lain/PyTorchModelHubMixin-template
🔗basic guide: https://huggingface.co/posts/not-lain/884273241241808
I have finished writing a blogpost about building an image-based retrieval system, This is one of the first-ever approaches to building such a pipeline using only open-source models/libraries 🤗

You can checkout the blogpost in https://huggingface.co/blog/not-lain/image-retriever and the associated space at
not-lain/image-retriever
.

If you want to request another blog post consider letting me know down below or you can reach out to me through any of my social media

📖 Happy reading ! Image-based search engine
AI Comic Factory
Last release: AI Comic Factory 1.2

The AI Comic Factory will soon have an official website: aicomicfactory.app

For more information about my other projects please check linktr.ee/FLNGR.

Running the project at home
First, I would like to highlight that everything is open-source (see here, here, here, here).

However the project isn't a monolithic Space that can be duplicated and ran immediately: it requires various components to run for the frontend, backend, LLM, SDXL etc.

If you try to duplicate the project, open the .env you will see it requires some variables.
distilabel 1.3.0 is out! This release contains many core improvements and new tasks that help us building
argilla/magpie-ultra-v0.1
!

Distributed pipeline execution with Ray, new Magpie tasks, reward models, components for dataset diversity based on sentence embeddings, Argilla 2.0 compatibility and many more features!

Check the new release in GitHub: https://github.com/argilla-io/distilabel
Post
171

Remember when @mistralAI said large enough and casually dropped Mistral-Large-Instruct-2407? 🤯🚀

It's now on http://lmsys.org! 🌐 It works amazing for instruction following, hard prompts, coding, and longer queries with only 123 billion parameters. 💡💻

It outperforms GPT4-Turbo and Claude 3 Opus on Coding, Hard Prompts, Math, and Longer Query categories. 📈🔢

It also outperforms Llama 3.1 405B on Instruction Following while being 3x smaller. 🐎🔍

It also does exceedingly well on the Ai2 ZebraLogic logistic reasoning benchmark despite being much smaller than the other models. 🦓🤔

Mistral is not here to take part but to take over! 🏆🌟

Model: https://mistral.ai/news/mistral-large-2407/
JoseRFJunior/TransNAR

https://github.com/JoseRFJuniorLLMs/TransNAR
https://arxiv.org/html/2406.09308v1
TransNAR hybrid architecture. Similar to Alayrac et al, we interleave existing Transformer layers with gated cross-attention layers which enable information to flow from the NAR to the Transformer. We generate queries from tokens while we obtain keys and values from nodes and edges of the graph. The node and edge embeddings are obtained by running the NAR on the graph version of the reasoning task to be solved. When experimenting with pre-trained Transformers, we initially close the cross-attention gate, in order to fully preserve the language model’s internal knowledge at the beginning of training. GitHub - JoseRFJuniorLLMs/TransNAR: Transformers and Reasoners Algorítmicos Neurais (NARs)
🔥 New state of the art model for background removal is out
🤗 You can try the model at
ZhengPeng7/BiRefNet

📈 model shows impressive results outperforming
briaai/RMBG-1.4

🚀 you can try out the model in:
ZhengPeng7/BiRefNet_demo


📃paper:
Bilateral Reference for High-Resolution Dichotomous Image Segmentation (2401.03407)https://cdn-uploads.huggingface.co/production/uploads/6527e89a8808d80ccff88b7a/lMX02zCeSDvLulbFFuT7N.png
PyTorch implementation of the Self-Compression & Differentiable Quantization Algorithm introduced in “Self-Compressing Neural Networks” paper.

The algorithm shows dynamic neural network compression during training - with reduced size of weight, activation tensors and bits required to represent weights.

It’s basically shrinking the neural network size (weights and activations) as it’s being trained without compromising performance - this helps reduce compute and inference cost.


Code: https://github.com/Jaykef/ai-algorithms
Paper: https://arxiv.org/pdf/2301.13142