ml4se

Research talk: Cloud Intelligence/AIOps – Infusing AI into cloud computing (Microsoft)

51 views10:58

Fixing Dockerfile Smells: An Empirical Study

RQ1: How do developers fix Dockerfile smells?
RQ2: Which Dockerfile smells are developers willing to address?

101 views14:16

ml4se

DALL·E API Now Available in Public Beta

Developers can now integrate DALL·E directly into their apps and products through API.

Openai

DALL·E API now available in public beta

Starting today, developers can begin building apps with the DALL·E API.

50 views17:13

ml4se

Microsoft sued for open-source piracy through GitHub Copilot

Programmer and lawyer Matthew Butterick has sued Microsoft, GitHub, and OpenAI, alleging that GitHub's Copilot violates the terms of open-source licenses and infringes the rights of programmers.

Apart from the license violations, Butterick also alleges that the development feature violates the following:
- GitHub's terms of service and privacy policies,
- DMCA 1202, which forbids the removal of copyright-management information,
- the California Consumer Privacy Act,
- and other laws giving rise to the related legal claims.

The complaint was submitted to the U.S. District Court of the Northern District of California, demanding the approval of statutory damages of $9,000,000,000.

BleepingComputer

Microsoft sued for open-source piracy through GitHub Copilot

Programmer and lawyer Matthew Butterick has sued Microsoft, GitHub, and OpenAI, alleging that GitHub's Copilot violates the terms of open-source licenses and infringes the rights of code authors.

😁1

48 views06:50

ml4se

TOSS: Revisiting Code Search in a Two-Stage Paradigm (Microsoft)

The paper proposes a combination of two main DL-based approaches to code search — a fusion of bi-encoder and cross-encoder methods. The framework achieves state-of-the-art accuracy with an overall mean reciprocal ranking score of 0.763, compared to the best baseline result on the CodeSearchNet benchmark of 0.713.

49 views09:50

ml4se

μBERT: Mutation Testing using Pre-Trained Language Models

μBERT is a mutation testing tool. It exploits CodeBERT to generate mutants. The proposed approach is compared with PiTest on fault detection and assertion inference.

51 views11:46

ml4se

The Illustrated Stable Diffusion

A gentle introduction to how Stable Diffusion works.

58 views15:00

ml4se

Hey, Copilot!

Voice-based interaction with GitHub Copilot

The GitHub Blog

Everything new from GitHub Universe 2022

See what we're building to enhance the most integrated developer platform that allows developers and enterprises to drive innovation with ease.

111 views01:46

ml4se

TiCoder: Interactive Code Generation via Test-Driven User-IntentFormalization (Microsoft)

Test-driven user-intent formalization (or test-driven user-intent discovery): to create an interactive framework to (a) refine and formalize the user intent through generated tests, and (b) generate code that is consistent with such tests.

52 views10:32

ml4se

Time-Series Anomaly Detection with Implicit Neural Representation

Some ML4SE tasks are related to time series (anomaly detection in logs, forecasting in resource management, etc.). A novel method called Implicit Neural Representation-based Anomaly Detection (INRAD) is proposed. It uses error-based anomaly detection strategy. Using MLP, it learns to predict the value of a time series by a timestamp. The timestamp is the only input.

58 views15:18

ml4se

HyperTime: Implicit Neural Representation for Time Series

This architecture leverages INRs to learn a compressed latent representation of an entire time series dataset. The output of the HyperNet is a one-dimensional 7500-values embedding that contains the network weights of an INR (HypoNet) which encodes the time series data from the input.

62 views15:25

ml4se

Cloud Intelligence/AIOps – Infusing AI into Cloud Computing Systems (Microsoft)

AIOps is a rapidly emerging technology trend and an interdisciplinary research direction across system, software engineering, and AI/ML communities. With years of research on Cloud Intelligence, Microsoft Research has built up rich technology assets in detection, diagnosis, prediction, and optimization.

73 views14:02

ml4se

Scientists and government representatives meeting at a conference in France have voted to scrap leap seconds by 2035, the organisation responsible for global timekeeping has said.

In November 2022 at the 27th General Conference on Weights and Measures, held about every four years at the Versailles Palace, it was decided to abandon the leap second by or before 2035. From then the difference between atomic and astronomical time will be allowed to grow to a larger value yet to be determined.

the Guardian

Do not adjust your clock: scientists call time on the leap second

Second added periodically to synchronise atomic time and Earth time can cause problems for GPS systems, software and telecoms

72 views05:07

ml4se

CS598: Machine Learning for Software Engineering

- Code representation and embeddings
- Source code analysis
- Code summarization
- Test input generation
- Fuzz testing
- Oracle inference
- Fault localization
- Program (bug) repair
- Regression testing
- Security testing and vulnerability detection
- Code completion
- Clone detection

🔥2

217 views16:27

ml4se

Course: Machine Learning for Software Engineering (Ural State University)

- Introduction to machine learning
- Introduction to Transformer
- Code representation 1
- Code representation 2
- Code generation
- Code summarization
- Clone detection
- Code search 1
- Code search 2
- Code completion
- Vulnerabilities

GitHub

GitHub - konygin/course_ml4se: ML4SE course

ML4SE course. Contribute to konygin/course_ml4se development by creating an account on GitHub.

85 viewsedited 14:21

ml4se

Large Language Models Can Self-Improve

CoT + multiple path decoding + self-consistency = effective self-training

74.4%->82.1% on GSM8K
78.2%->83.0% on DROP
90.0%->94.4% on OpenBookQA
63.4%->67.9% on ANLI-A3

67 viewsedited 14:41

ml4se

Is effective self-training possible for small and medium-sized models?

Anonymous Poll

57%

Yes

43%

7 voters115 views14:45

ml4se

CodeQL code scanning launches Kotlin analysis support

Starting November 28, GitHub code scanning includes beta support for analyzing code written in Kotlin, powered by the CodeQL engine.

72 views06:12

ml4se

Advent of Code is an annual set of Christmas-themed computer programming challenges that follow an Advent calendar. It has been running since 2015. The programming puzzles cover a variety of skill sets and skill levels and can be solved using any programming language.

OpenAI Solved Part 1 in 10 Seconds
https://www.reddit.com/r/adventofcode/comments/zb942v/2022_day_03_first_place_for_part_1_today_10/

r/adventofcode on Reddit: [2022 Day 03] First place for part 1 today (10 seconds!) was fully automated using new OpenAI language…

Posted by u/rk-imn - No votes and 3 comments

166 views12:58

ml4se

Advent of Code
ChatGPT edition: https://github.com/ishan0102/aoc-2022-chatgpt

GitHub

GitHub - ishan0102/aoc-2022-chatgpt: ChatGPT's solutions to Advent-of-Code 2022

ChatGPT's solutions to Advent-of-Code 2022. Contribute to ishan0102/aoc-2022-chatgpt development by creating an account on GitHub.

64 views13:00

ml4se

PyTorch 2.0

Faster, more pythonic and as dynamic as ever

PyTorch