IBM Granite: A Family of Open Foundation Models for Code Intelligence (opens in new tab)

(github.com)

252 pointslukhas2y ago74 comments

74 comments

3B, 8B, 20B and 34B parameter model weights available here: https://huggingface.co/collections/ibm-granite/granite-code-...

dusanh2y ago

I'm a complete newb when it comes to AI, and I am getting pretty ashamed of it too. How do I take a model like this and use it in my day to day? Can I somehow use in, say, VSCode? How do I point it at my code base, and use it to help me write new code?

everforward2y ago

You run most of these models in something that wraps them in an HTTP API. I use Ollama, which I think is the most popular but I’m not in a great position to judge. My impression is that it handles running models on CPU better.

So you’d basically install Ollama, download one of the versions of this model off HuggingFace, create a Modelfile since this isn’t in the default Ollama repo, and then Ollama can answer prompts with the model. Modelfiles are very simple, based on Dockerfiles. It takes like 15 seconds to make one if you aren’t messing with the various parameters.

Once it’s in Ollama, just get one of the various GPT plugins for VSCode and give it the Ollama URL (http://localhost:11434 by default). I use continue.dev but there are many.

Continue takes over the tab autocomplete with the LLM, and has a chat window on the right where you can use keyboard shortcuts to copy code into the prompt and ask it to edit/generate code or ask questions about existing code.

homarp2y ago

if you can compile stuff, then looking at llama.cpp (what ollama uses) is also interesting: https://github.com/ggerganov/llama.cpp

the server is here: https://github.com/ggerganov/llama.cpp/tree/master/examples/...

And you can search for any GGUF on huggingface

dusanh2y ago

Thank you so much! That sounds surprisingly straightforward. I expected a lot more fiddling to get going.

Where would I start if I wanted to use a model programmatically ? Like let's say I am building a chat bot. I have a large data set of replies I want the model to mimic, and I'd want to do this in Python. Of course, I'd probably use a different model than Granite.

1 more reply

huijzer2y ago

https://github.com/TabbyML/tabby can run self-hosted AI coding assistants. I tried it a while ago and it worked with Nvim pretty easily. There is a VS code extension too. The extension will just sort of "read" with you and provide suggestions from time to time. Anytime the suggestion is good you can press some key (<TAB> by default) to accept it. It's basically autocomplete on steroids.

mark_l_watson2y ago

If you like Emacs (I use both Emacs and VSCode, for slightly different coding use cases), then the Emacs elamma [1]package is very nice. It is set up out of the box to use Ollama and to use M-x commands for code completion, summarization, and dozens of other useful functions. I love it, your mileage may vary.

[1] https://github.com/s-kostyaev/ellama

victor90002y ago

Does anyone know of other open models available for code intelligence?

sp3322y ago

WizardCoder, StarCoder, CodeLlama?

ekianjo2y ago

deepseek coder as well

1 more reply

adt2y ago

https://lifearchitect.ai/models-table/

dur-randir2y ago

Based on their own numbers, 8B seems decent, but 34B not worth it compared to general-purpose trained models even on specific tasks. Which is an interesting result.

continuational2y ago

Is there an online demo of this somewhere?

koryk2y ago

I am seeing at least one granite model on ollama, wonder when they will all show up!

throwaway2902y ago

As usual, license/copyright violation:

> Our process to prepare code pretraining data involves several stages. First, we collect a combination of publicly available datasets (e.g., GitHub Code Clean, Starcoder data), public code repositories, and issues from GitHub

reacharavindh2y ago

https://i.kym-cdn.com/photos/images/original/001/138/631/b7a...

holografix2y ago

Is this a segway for IBM to release Terraform specific LLMs so I never have to write that hot garbage ever again? Sign me up IBM!

snapcaster2y ago

just a heads up it's segue in the context you're using it

1 more reply

nwsm2y ago

Here's a similar existing product- https://www.ibm.com/products/watsonx-code-assistant-ansible-...

hustwindmaple12y ago

I wonder why companies like IBM are jumping on the LLM bandwagon and training/releasing models that have no chance of competing with Llama/Mistral? To me it just looks like a complete waste of $$ because nobody will use them in any serious scenarios

paxys2y ago

IBM made $60 billion in revenue last year. Where do you think it all came from? The same companies/governments that buy their overpriced crap are going to buy these new LLMs as well.

breezeTrowel2y ago

These are open weight models released under an Apache 2.0 license. There's nothing to buy.

2 more replies

xarope2y ago

Enterprises think differently. They want data provenance, privacy, ability to mitigate/transfer risk etc. If IBM is willing to offer that, there will be enterprises that bite.

_lvbh2y ago

Llama and Mistral are already local & fulfill these requirements

4 more replies

insane_dreamer2y ago

“Nobody was ever fired for hiring IBM”

mhh__2y ago

IBM do a mixture of shovelware and extremely hardcore tech so they could honestly go either way with this.

semi-extrinsic2y ago

Agreed. For example their research lab in Zurich has been absolutely world-leading in things like atomic force microscopy (AFM) for four decades, including the Nobel prize in Physics in 1986 (AFM) and 1987 (high-temperature superconductivity). They also invented things like trellis coding and token ring.

propter_hoc2y ago

> mixture of shovelware and extremely hardcore tech

Citation needed

All I've seen from them in my professional experience is actually legacy mainframe maintenance.. Not shovelware, but very far from hardcore tech.

7 more replies

Brajeshwar2y ago

When they pitch potential clients for their services, their slides on LLM, AI, ML, etc., must be their own. Whether they use it or not for the services does not matter. These are like the side projects that service companies release to help them close their clients.

cess112y ago

Same reason they jumped on the clown bandwagon, it's the kind of offering it's expected to have when you're a company like that. Huge size, leading research departments, big enterprise customers.

They've been doing "AI" for ages. Notably Watson over the last couple of decades or so.

jujube32y ago

What is "the clown bandwagon"?

Jedd2y ago

> ... models that have no chance of competing with ...

I've not seen any proper evaluations for Granite against, say, Llama or Mistral.

Until we do it's probably too early to say they can't compete, at least in some areas where others perform poorly.

abdullin2y ago

They are Ok-ish.

Previous Granite models were on the level of first llama in my benchmarks.

I’m expecting this version to be roughly comparable to llama 2

logicchains2y ago

>I wonder why companies like IBM are jumping on the LLM bandwagon and training/releasing models that have no chance of competing with Llama/Mistral

Did you even read the benchmarks they post on that link? Assuming they're not outright lying, their 8B model is superior to Llama/Mistral models of the same size for coding tasks.

papruapap2y ago

prob getting some reputation in AI space will help them to sell watsonx. tbf, watson predates Transformers paper.

halJordan2y ago

On the other hand i spend my time wondering why people like you think someone should just throw away their ideas simply because there's already someone in the niche.

j / k navigate · click thread line to collapse

74 comments

sbierwagen2y ago

3B, 8B, 20B and 34B parameter model weights available here: https://huggingface.co/collections/ibm-granite/granite-code-...

dusanh2y ago

everforward2y ago

Once it’s in Ollama, just get one of the various GPT plugins for VSCode and give it the Ollama URL (http://localhost:11434 by default). I use continue.dev but there are many.

homarp2y ago

if you can compile stuff, then looking at llama.cpp (what ollama uses) is also interesting: https://github.com/ggerganov/llama.cpp

the server is here: https://github.com/ggerganov/llama.cpp/tree/master/examples/...

And you can search for any GGUF on huggingface

dusanh2y ago

Thank you so much! That sounds surprisingly straightforward. I expected a lot more fiddling to get going.

1 more reply

huijzer2y ago

mark_l_watson2y ago

[1] https://github.com/s-kostyaev/ellama

victor90002y ago

Does anyone know of other open models available for code intelligence?

sp3322y ago

WizardCoder, StarCoder, CodeLlama?

ekianjo2y ago

deepseek coder as well

1 more reply

adt2y ago

https://lifearchitect.ai/models-table/

dur-randir2y ago

Based on their own numbers, 8B seems decent, but 34B not worth it compared to general-purpose trained models even on specific tasks. Which is an interesting result.

continuational2y ago

Is there an online demo of this somewhere?

koryk2y ago

I am seeing at least one granite model on ollama, wonder when they will all show up!

throwaway2902y ago

As usual, license/copyright violation:

reacharavindh2y ago

https://i.kym-cdn.com/photos/images/original/001/138/631/b7a...

holografix2y ago

Is this a segway for IBM to release Terraform specific LLMs so I never have to write that hot garbage ever again? Sign me up IBM!

snapcaster2y ago

just a heads up it's segue in the context you're using it

1 more reply

nwsm2y ago

Here's a similar existing product- https://www.ibm.com/products/watsonx-code-assistant-ansible-...

hustwindmaple12y ago

paxys2y ago

IBM made $60 billion in revenue last year. Where do you think it all came from? The same companies/governments that buy their overpriced crap are going to buy these new LLMs as well.

breezeTrowel2y ago

These are open weight models released under an Apache 2.0 license. There's nothing to buy.

2 more replies

xarope2y ago

Enterprises think differently. They want data provenance, privacy, ability to mitigate/transfer risk etc. If IBM is willing to offer that, there will be enterprises that bite.

_lvbh2y ago

Llama and Mistral are already local & fulfill these requirements

4 more replies

insane_dreamer2y ago

“Nobody was ever fired for hiring IBM”

mhh__2y ago

IBM do a mixture of shovelware and extremely hardcore tech so they could honestly go either way with this.

semi-extrinsic2y ago

propter_hoc2y ago

> mixture of shovelware and extremely hardcore tech

Citation needed

All I've seen from them in my professional experience is actually legacy mainframe maintenance.. Not shovelware, but very far from hardcore tech.

7 more replies

Brajeshwar2y ago

cess112y ago

Same reason they jumped on the clown bandwagon, it's the kind of offering it's expected to have when you're a company like that. Huge size, leading research departments, big enterprise customers.

They've been doing "AI" for ages. Notably Watson over the last couple of decades or so.

jujube32y ago

What is "the clown bandwagon"?

Jedd2y ago

> ... models that have no chance of competing with ...

I've not seen any proper evaluations for Granite against, say, Llama or Mistral.

Until we do it's probably too early to say they can't compete, at least in some areas where others perform poorly.

abdullin2y ago

They are Ok-ish.

Previous Granite models were on the level of first llama in my benchmarks.

I’m expecting this version to be roughly comparable to llama 2

logicchains2y ago

>I wonder why companies like IBM are jumping on the LLM bandwagon and training/releasing models that have no chance of competing with Llama/Mistral

Did you even read the benchmarks they post on that link? Assuming they're not outright lying, their 8B model is superior to Llama/Mistral models of the same size for coding tasks.

papruapap2y ago

prob getting some reputation in AI space will help them to sell watsonx. tbf, watson predates Transformers paper.

halJordan2y ago

On the other hand i spend my time wondering why people like you think someone should just throw away their ideas simply because there's already someone in the niche.

j / k navigate · click thread line to collapse