Data Science | Machine Learning with Python for Researchers
31.5K subscribers
1.59K photos
102 videos
22 files
1.87K links
Admin: @HusseinSheikho

The Data Science and Python channel is for researchers and advanced programmers

Buy ads: https://telega.io/c/dataScienceT
Download Telegram
A list of the best Telegram channels related to data science, programming languages, and artificial intelligence.

Join Quickly:
https://t.iss.one/addlist/8_rRW2scgfRhOTc0
❤‍🔥3
🏔️ Large Language Model for Geoscience

We introduce K2 (7B), an open-source language model trained by firstly further pretraining LLaMA on collected and cleaned geoscience literature, including geoscience open-access papers and Wikipedia pages, and secondly fine-tuning with knowledge-intensive instruction tuning data (GeoSignal).

git clone https://github.com/davendw49/k2.git
cd k2
conda env create -f k2.yml
conda activate k2


🖥 Github: https://github.com/davendw49/k2

⭐️ Demo: https://huggingface.co/daven3/k2_fp_delta

📕 Paper: https://arxiv.org/abs/2306.05064v1

🔗 Dataset: https://huggingface.co/datasets/daven3/geosignal

https://t.iss.one/DataScienceT
❤‍🔥4👍2
💲 FinGPT: Open-Source Financial Large Language Models

Unlike proprietary models, FinGPT takes a data-centric approach, providing researchers and practitioners with accessible and transparent resources to develop their FinLLMs.

🖥 Github: https://github.com/ai4finance-foundation/fingpt

⭐️ FinNLP: https://github.com/ai4finance-foundation/finnlp

📕 Paper: https://arxiv.org/abs/2306.06031v1

🔗 Project: https://ai4finance-foundation.github.io/FinNLP/

https://t.iss.one/DataScienceT
❤‍🔥4👍41
You can now download and watch all paid data science courses for free by subscribing to our new channel

https://t.iss.one/udemy13
👍2❤‍🔥1
Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement
at 100k Steps-Per-Second

🖥 Github: https://github.com/facebookresearch/galactic

Paper: https://arxiv.org/pdf/2306.07552v1.pdf

💨 Dataset: https://paperswithcode.com/dataset/vizdoom

https://t.iss.one/DataScienceT
❤‍🔥41
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

Macaw-LLM is a model of its kind, bringing together state-of-the-art models for processing visual, auditory, and textual information, namely CLIP, Whisper, and LLaMA.

🖥 Github: https://github.com/lyuchenyang/macaw-llm

⭐️ Model: https://tinyurl.com/yem9m4nf

📕 Paper: https://tinyurl.com/4rsexudv

🔗 Dataset: https://github.com/lyuchenyang/Macaw-LLM/blob/main/data

https://t.iss.one/DataScienceT
👍42❤‍🔥1
Semi-supervised learning made simple with self-supervised clustering [CVPR 2023]

🖥 Github: https://github.com/pietroastolfi/suave-daino

Paper: https://arxiv.org/pdf/2306.07483v1.pdf

💨 Dataset: https://paperswithcode.com/dataset/imagenet

https://t.iss.one/DataScienceT
❤‍🔥21👍1
How do Transformers work?

All
the Transformer models mentioned above (GPT, BERT, BART, T5, etc.) have been trained as language models. This means they have been trained on large amounts of raw text in a self-supervised fashion. Self-supervised learning is a type of training in which the objective is automatically computed from the inputs of the model. That means that humans are not needed to label the data!

This type of model develops a statistical understanding of the language it has been trained on, but it’s not very useful for specific practical tasks. Because of this, the general pretrained model then goes through a process called transfer learning. During this process, the model is fine-tuned in a supervised way — that is, using human-annotated labels — on a given task

🔗 Read More

🌸 https://t.iss.one/DataScienceT
👍32❤‍🔥2
Data Science With Python Workflow Cheat Sheet

Creator: business Science
Stars ⭐️: 75
Forked By: 38

https://github.com/business-science/cheatsheets/blob/master/Data_Science_With_Python_Workflow.pdf

https://t.iss.one/DataScienceT
👍53
80+ Jupyter Notebook tutorials on image classification, object detection and image segmentation in various domains
📌 Agriculture and Food
📌 Medical and Healthcare
📌 Satellite
📌 Security and Surveillance
📌 ADAS and Self Driving Cars
📌 Retail and E-Commerce
📌 Wildlife

Classification library
https://github.com/Tessellate-Imaging/monk_v1

Notebooks - https://github.com/Tessellate-Imaging/monk_v1/tree/master/study_roadmaps/4_image_classification_zoo

Detection and Segmentation Library
https://github.com/Tessellate-Imaging/

Monk_Object_Detection
Notebooks: https://github.com/Tessellate-Imaging/Monk_Object_Detection/tree/master/application_model_zoo

https://t.iss.one/DataScienceT
👍7❤‍🔥3