Human Guided Ground-truth Generation for Realistic Image Super-resolution
π₯ Github: https://github.com/chrisdud0257/hggt
β© Paper: https://arxiv.org/abs/2303.13069
π¨ Dataset: https://paperswithcode.com/dataset/div2k
https://t.iss.one/DataScienceT
π₯ Github: https://github.com/chrisdud0257/hggt
β© Paper: https://arxiv.org/abs/2303.13069
π¨ Dataset: https://paperswithcode.com/dataset/div2k
https://t.iss.one/DataScienceT
β€2β€βπ₯1
ImageNet-E: Benchmarking Neural Network Robustness via Attribute Editing
π₯ Github: https://github.com/alibaba/easyrobust/tree/main/benchmarks/imagenet-e
β© Paper: https://arxiv.org/abs/2303.17096v1
π¨ Dataset: https://paperswithcode.com/dataset/objectnet
https://t.iss.one/DataScienceT
π₯ Github: https://github.com/alibaba/easyrobust/tree/main/benchmarks/imagenet-e
β© Paper: https://arxiv.org/abs/2303.17096v1
π¨ Dataset: https://paperswithcode.com/dataset/objectnet
https://t.iss.one/DataScienceT
β€βπ₯4
β‘οΈToken Merging for Stable Diffusion
Token Merging (ToMe) speeds up transformers by merging redundant tokens, which means the transformer has to do less work.
π₯ Github: https://github.com/dbolya/tomesd
β© Paper: https://arxiv.org/abs/2303.17604v1
π¨ Blog: https://research.facebook.com/blog/2023/2/token-merging-your-vit-but-faster/
https://t.iss.one/DataScienceT
Token Merging (ToMe) speeds up transformers by merging redundant tokens, which means the transformer has to do less work.
pip install tomesd
π₯ Github: https://github.com/dbolya/tomesd
β© Paper: https://arxiv.org/abs/2303.17604v1
π¨ Blog: https://research.facebook.com/blog/2023/2/token-merging-your-vit-but-faster/
https://t.iss.one/DataScienceT
β€βπ₯1
βοΈ HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace
Language serves as an interface for LLMs to connect numerous AI models for solving complicated AI tasks!
π₯ Github: https://github.com/microsoft/JARVIS
β© Paper: https://arxiv.org/abs/2303.17604v1
https://t.iss.one/DataScienceT
Language serves as an interface for LLMs to connect numerous AI models for solving complicated AI tasks!
π₯ Github: https://github.com/microsoft/JARVIS
β© Paper: https://arxiv.org/abs/2303.17604v1
https://t.iss.one/DataScienceT
π4
WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation
π₯ Github: https://github.com/hustvl/weaktr
β© Paper: https://arxiv.org/abs/2304.01184v1
π¨ Dataset: https://paperswithcode.com/dataset/imagenet
https://t.iss.one/DataScienceT
π₯ Github: https://github.com/hustvl/weaktr
β© Paper: https://arxiv.org/abs/2304.01184v1
π¨ Dataset: https://paperswithcode.com/dataset/imagenet
https://t.iss.one/DataScienceT
β€βπ₯2
This media is not supported in your browser
VIEW IN TELEGRAM
Test of Time: Instilling Video-Language Models with a Sense of Time
GPT-5 will likely have video abilities, but will it have a sense of time? Here is answer to this question in #CVPR2023 paper by student of University of Amsterdam to learn how to instil time into video-language foundation models.
Paper:
https://arxiv.org/abs/2301.02074
Code:
https://github.com/bpiyush/TestOfTime
Project Page:
https://bpiyush.github.io/testoftime-website/
https://t.iss.one/DataScienceT
GPT-5 will likely have video abilities, but will it have a sense of time? Here is answer to this question in #CVPR2023 paper by student of University of Amsterdam to learn how to instil time into video-language foundation models.
Paper:
https://arxiv.org/abs/2301.02074
Code:
https://github.com/bpiyush/TestOfTime
Project Page:
https://bpiyush.github.io/testoftime-website/
https://t.iss.one/DataScienceT
β€βπ₯4
This media is not supported in your browser
VIEW IN TELEGRAM
Segment Anything
The Segment Anything Model (SAM) produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image.
π₯ Github: https://github.com/facebookresearch/segment-anything
βοΈ Project: https://segment-anything.com/
β© Paper: https://arxiv.org/abs/2304.02643v1
π¨ Dataset: https://segment-anything.com/dataset/index.html
https://t.iss.one/DataScienceT
The Segment Anything Model (SAM) produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image.
π₯ Github: https://github.com/facebookresearch/segment-anything
βοΈ Project: https://segment-anything.com/
β© Paper: https://arxiv.org/abs/2304.02643v1
π¨ Dataset: https://segment-anything.com/dataset/index.html
https://t.iss.one/DataScienceT
β€βπ₯5
Media is too big
VIEW IN TELEGRAM
Painter β SegGPT: Vision Foundation Models from BAAI
SegGPT, a generalist model for segmenting everything in context.
π₯ Github: https://github.com/baaivision/painter
β© Paper: https://arxiv.org/abs/2304.03284v1
β© Demo: https://huggingface.co/spaces/BAAI/SegGPT
π¨ Dataset: https://paperswithcode.com/dataset/youtube-vos
https://t.iss.one/DataScienceT
SegGPT, a generalist model for segmenting everything in context.
π₯ Github: https://github.com/baaivision/painter
β© Paper: https://arxiv.org/abs/2304.03284v1
β© Demo: https://huggingface.co/spaces/BAAI/SegGPT
π¨ Dataset: https://paperswithcode.com/dataset/youtube-vos
https://t.iss.one/DataScienceT
β€βπ₯3π1
To watch paid channel content
All you have to do is subscribe to the paid channel. The paid channel includes multiple and huge programming courses, in addition to very useful books that are not available for free except in the paid channel.
To request a subscription: talk to @Hussein_Sheikho
Channel link: https://t.iss.one/+LnCmAFJO3tNmYjUy
βοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈ
We launched a special bot some time ago to download all scientific, software and mathematics books The bot contains more than thirty million books, and new books are downloaded first, In addition to the possibility of downloading all articles and scientific papers for free
To request a subscription: talk to @Hussein_Sheikho
All you have to do is subscribe to the paid channel. The paid channel includes multiple and huge programming courses, in addition to very useful books that are not available for free except in the paid channel.
To request a subscription: talk to @Hussein_Sheikho
Channel link: https://t.iss.one/+LnCmAFJO3tNmYjUy
βοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈβοΈ
We launched a special bot some time ago to download all scientific, software and mathematics books The bot contains more than thirty million books, and new books are downloaded first, In addition to the possibility of downloading all articles and scientific papers for free
To request a subscription: talk to @Hussein_Sheikho
β€βπ₯2
Instruction Tuning with GPT-4
First attempt to use GPT-4 to generate instruction-following data for LLM finetuning.
π₯ Github: https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM
β© Paper: https://arxiv.org/abs/2304.03277v1
β© Project: https://instruction-tuning-with-gpt-4.github.io/
https://t.iss.one/DataScienceT
First attempt to use GPT-4 to generate instruction-following data for LLM finetuning.
π₯ Github: https://github.com/Instruction-Tuning-with-GPT-4/GPT-4-LLM
β© Paper: https://arxiv.org/abs/2304.03277v1
β© Project: https://instruction-tuning-with-gpt-4.github.io/
https://t.iss.one/DataScienceT
π3
βοΈ OpenAGI: When LLM Meets Domain Experts
Reinforcement Learning from Task Feedback (RLTF) mechanism, which uses the task-solving result as feedback to improve the LLM's task-solving ability
git clone https://github.com/agiresearch/OpenAGI.git
π₯ Github: https://github.com/agiresearch/openagi
β© Paper: https://arxiv.org/pdf/2304.04370.pdf
βοΈ Dataset: https://drive.google.com/drive/folders/1AjT6y7qLIMxcmHhUBG5IE1_5SnCPR57e?usp=share_link
https://t.iss.one/DataScienceT
Reinforcement Learning from Task Feedback (RLTF) mechanism, which uses the task-solving result as feedback to improve the LLM's task-solving ability
git clone https://github.com/agiresearch/OpenAGI.git
π₯ Github: https://github.com/agiresearch/openagi
β© Paper: https://arxiv.org/pdf/2304.04370.pdf
βοΈ Dataset: https://drive.google.com/drive/folders/1AjT6y7qLIMxcmHhUBG5IE1_5SnCPR57e?usp=share_link
https://t.iss.one/DataScienceT
β€βπ₯3π1
βοΈ Hard Patches Mining for Masked Image Modeling
We observe that the reconstruction loss can naturally be the metric of the difficulty of the pre-training task.
π₯ Github: https://github.com/haochen-wang409/hpm
β© Paper: https://arxiv.org/abs/2304.05919v1
βοΈ Dataset: https://paperswithcode.com/dataset/ade20k
https://t.iss.one/DataScienceT
We observe that the reconstruction loss can naturally be the metric of the difficulty of the pre-training task.
π₯ Github: https://github.com/haochen-wang409/hpm
β© Paper: https://arxiv.org/abs/2304.05919v1
βοΈ Dataset: https://paperswithcode.com/dataset/ade20k
https://t.iss.one/DataScienceT
π3β€βπ₯2
πSEEM: Segment Everything Everywhere All at Once
Universal, interactive multi-modal interface for any types of segmentation with ONE SINGLE MODE.
π₯ Github: https://github.com/ux-decoder/segment-everything-everywhere-all-at-once
β© Paper: https://arxiv.org/abs/2304.06718v1
βοΈ Dataset: https://paperswithcode.com/dataset/refcoco
https://t.iss.one/DataScienceT
Universal, interactive multi-modal interface for any types of segmentation with ONE SINGLE MODE.
π₯ Github: https://github.com/ux-decoder/segment-everything-everywhere-all-at-once
β© Paper: https://arxiv.org/abs/2304.06718v1
βοΈ Dataset: https://paperswithcode.com/dataset/refcoco
https://t.iss.one/DataScienceT
β€βπ₯2
This media is not supported in your browser
VIEW IN TELEGRAM
π¨ Animated Drawings
A Method for Automatically Animating Children's Drawings of the Human Figure
π₯ Github: https://github.com/facebookresearch/AnimatedDrawings
βοΈProject: https://fairanimateddrawings.com/site/home
β© Paper: arxiv.org/pdf/2303.12741.pdf
https://t.iss.one/DataScienceT
A Method for Automatically Animating Children's Drawings of the Human Figure
π₯ Github: https://github.com/facebookresearch/AnimatedDrawings
βοΈProject: https://fairanimateddrawings.com/site/home
β© Paper: arxiv.org/pdf/2303.12741.pdf
https://t.iss.one/DataScienceT
β€βπ₯3π2
ββInceptionNeXt: When Inception Meets ConvNeXt
Large-kernel convolutions, such as those employed in ConvNeXt, can improve model performance but often come at the cost of efficiency due to high memory access costs. Although reducing kernel size may increase speed, it often leads to significant performance degradation.
To address this issue, the authors propose InceptionNeXt, which decomposes large-kernel depthwise convolution into four parallel branches along the channel dimension. This new Inception depthwise convolution results in networks with high throughputs and competitive performance. For example, InceptionNeXt-T achieves 1.6x higher training throughputs than ConvNeX-T and a 0.2% top-1 accuracy improvement on ImageNet-1K. InceptionNeXt has the potential to serve as an economical baseline for future architecture design, helping to reduce carbon footprint.
A detailed unofficial overview of the paper: https://andlukyane.com/blog/paper-review-inceptionnext
Paper link:https://arxiv.org/abs/2303.16900
Code link: https://github.com/sail-sg/inceptionnext
https://t.iss.one/DataScienceT
Large-kernel convolutions, such as those employed in ConvNeXt, can improve model performance but often come at the cost of efficiency due to high memory access costs. Although reducing kernel size may increase speed, it often leads to significant performance degradation.
To address this issue, the authors propose InceptionNeXt, which decomposes large-kernel depthwise convolution into four parallel branches along the channel dimension. This new Inception depthwise convolution results in networks with high throughputs and competitive performance. For example, InceptionNeXt-T achieves 1.6x higher training throughputs than ConvNeX-T and a 0.2% top-1 accuracy improvement on ImageNet-1K. InceptionNeXt has the potential to serve as an economical baseline for future architecture design, helping to reduce carbon footprint.
A detailed unofficial overview of the paper: https://andlukyane.com/blog/paper-review-inceptionnext
Paper link:https://arxiv.org/abs/2303.16900
Code link: https://github.com/sail-sg/inceptionnext
https://t.iss.one/DataScienceT
artgor
Paper Review: InceptionNeXt: When Inception Meets ConvNeXt
My review of the paper InceptionNeXt When Inception Meets ConvNeXt
β€βπ₯5π2
π» Graph classification with Transformers
This notebook shows how to fine-tune the Graphormer model for Graph Classification on a dataset available on the hub.
π€Hugging face blog: https://huggingface.co/blog/graphml-classification
β© Intro to Graphs: https://t.iss.one/ai_machinelearning_big_data/3214
π₯ Github: https://github.com/huggingface/blog/blob/main/notebooks/graphml-classification.ipynb
β© Paper: https://arxiv.org/abs/2106.05234
βοΈDataset: https://ogb.stanford.edu/
https://t.iss.one/DataScienceT
This notebook shows how to fine-tune the Graphormer model for Graph Classification on a dataset available on the hub.
π€Hugging face blog: https://huggingface.co/blog/graphml-classification
β© Intro to Graphs: https://t.iss.one/ai_machinelearning_big_data/3214
π₯ Github: https://github.com/huggingface/blog/blob/main/notebooks/graphml-classification.ipynb
β© Paper: https://arxiv.org/abs/2106.05234
βοΈDataset: https://ogb.stanford.edu/
https://t.iss.one/DataScienceT
β€βπ₯2π2β€1
π An open, billion-scale corpus of images interleaved with text.
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
π₯ Github: https://github.com/allenai/mmc4
β© Paper: https://arxiv.org/abs/2304.06939v1
βοΈ Dataset: https://paperswithcode.com/dataset/c4
https://t.iss.one/DataScienceT
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
π₯ Github: https://github.com/allenai/mmc4
β© Paper: https://arxiv.org/abs/2304.06939v1
βοΈ Dataset: https://paperswithcode.com/dataset/c4
https://t.iss.one/DataScienceT
β€βπ₯3π1
STU-Net: Scalable and Transferable Medical Image Segmentation Models Empowered by Large-Scale Supervised Pre-training
π₯ Github: https://github.com/ziyan-huang/stu-net
β© Paper: https://arxiv.org/pdf/2304.06716v1.pdf
π¨ Dataset: https://paperswithcode.com/dataset/abdomenct-1k
https://t.iss.one/DataScienceT
π₯ Github: https://github.com/ziyan-huang/stu-net
β© Paper: https://arxiv.org/pdf/2304.06716v1.pdf
π¨ Dataset: https://paperswithcode.com/dataset/abdomenct-1k
https://t.iss.one/DataScienceT
β€βπ₯5
πΈ Omni Aggregation Networks for Lightweight Image Super-Resolution
Omni Self-attention paradigm for simultaneous spatial and channel interactions,mining all the potential correlations across omni-axis.
π₯ Github: https://github.com/francis0625/omni-sr
β© Paper: https://arxiv.org/abs/2304.10244v1
βοΈ Dataset: https://paperswithcode.com/dataset/manga109
https://t.iss.one/DataScienceT
Omni Self-attention paradigm for simultaneous spatial and channel interactions,mining all the potential correlations across omni-axis.
π₯ Github: https://github.com/francis0625/omni-sr
β© Paper: https://arxiv.org/abs/2304.10244v1
βοΈ Dataset: https://paperswithcode.com/dataset/manga109
https://t.iss.one/DataScienceT