ml4se

BLOOM is an autoregressive Large Language Model (LLM), trained to continue text from a prompt on vast amounts of text data using industrial-scale computational resources. As such, it is able to output coherent text in 46 languages and 13 programming languages that is hardly distinguishable from text written by humans. BLOOM can also be instructed to perform text tasks it hasn't been explicitly trained for, by casting them as text generation tasks.

78 views16:26

ml4se

MLGOPerf: An ML Guided Inliner to Optimize Performance (Huawei)

MLGOPerf — the first end-to-end framework capable of optimizing performance using LLVM’s ML-Inliner.

The experimental results show MLGOPerf is able to gain up to 1.8% and 2.2% with respect to LLVM’s optimization at O3 when trained for performance on SPEC CPU2006 and Cbench benchmarks, respectively. Furthermore, the proposed approach provides up to 26% increased opportunities to autotune code regions for our benchmarks which can be translated into an additional 3.7% speedup value.

77 views13:33

ml4se

CodeT: Code Generation with Generated Tests (Microsoft)

The work explores the use of pre-trained language models to automatically generate test cases. Method is titled CodeT: Code generation with generated Tests. CodeT executes the code solutions using the generated test cases, and then chooses the best solution based on a dual execution agreement with both the generated test cases and other generated solutions.

78 views06:50

ml4se

ESEC/FSE 2023
https://conf.researchr.org/home/fse-2023
Sat 11 - Fri 17 November 2023 San Francisco, California, United States

Thu 26 Jan 2023 Research Papers Paper registration
Thu 2 Feb 2023 Research Papers Full paper submission
Thu 4 May 2023 Research Papers Initial notification
Thu 29 Jun 2023 Research Papers Revised manuscript submissions (major revisions only)
Thu 27 Jul 2023 Research Papers Final notification for major revisions
Thu 24 Aug 2023 Research Papers Camera ready

conf.researchr.org

ESEC/FSE 2023

The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) is an internationally renowned forum for researchers, practitioners, and educators to present and discuss the most recent innovations…

64 views03:59

ml4se

Amazon CodeWhisperer is a machine learning (ML)–powered service that helps improve developer productivity by generating code recommendations based on their comments in natural language and code in the integrated development environment (IDE).

56 views14:42

ml4se

List of code generation models

59 views14:56

ml4se

Two surveys on Text-to-SQL: datasets, algorithms, metrics.
- Deep Learning Driven Natural Languages Text to SQL Query Conversion: A Survey
- Recent Advances in Text-to-SQL: A Survey of What We Have and What We Expect

87 views12:55

ml4se

Top Programming Languages 2022

79 views14:25

ml4se

So Much in So Little: Creating Lightweight Embeddings of Python Libraries (JetBrains, Huawei)

- python library embeddings
- a prototype tool for suggesting relevant libraries to a given project

66 views14:27

ml4se

The Universal Approximation Theorem for neural networks

58 views10:14

ml4se

Hardin and Taylor proved that any function of the real numbers can be correctly predicted based solely on its past behavior at almost any point in time. However, these results do not provide a practical means of predicting the future. For example, the definition of the \mu-strategy requires the Axiom of Choice. One might wonder if a different approach could yield similar results without using the Axiom of Choice. In nontrivial cases, the answer is no.

A peculiar connection between the axiom of choice and predicting the future

80 viewsedited 13:36

ml4se

Category Theory for AI

Program
Week 1: Why Category Theory?
Week 2: Essential building blocks: Categories and Functors
Week 3: Categorical Dataflow: Optics and Lenses as data structures for backpropagation
Week 4: Geometric Deep Learning & Naturality
Week 5: Monoids, Monads, Mappings, and lstMs

cats.for.ai

Categories for Machine Learning

This seminar series seeks to promote
the learning and use of Category Theory by Machine Learning Researchers

58 views07:47

ml4se

No More Fine-Tuning? An Experimental Evaluation of Prompt Tuning in Code Intelligence

The authors compare fine-tuning and prompt-tuning for different tasks in code understanding: defect prediction, code summarization and code translation. They come to the conclusion that prompt-tuning is more effective than fine-tuning on the code intelligence tasks, with respect to different pre-trained models and different programming languages. Besides, the advantage of prompt tuning is more obvious for smaller pre-trained models. Also prompt tuning is more effective in low-resource scenarios than fine-tuning. The fewer training instances, the larger the improvement achieved by prompt tuning. And prompt-tuning also shows superior performance on the cross-domain code intelligence task.

74 views10:09

About

Blog

Apps

Platform