ycmd: a code-completion & comprehension server
ycmd is a server that provides APIs for code-completion and other code-comprehension use-cases like semantic GoTo commands (and others). For certain filetypes, ycmd can also provide diagnostic errors and warnings.
ycmd was originally part of YouCompleteMe's codebase, but has been split out into a separate project so that it can be used in editors other than Vim.
Check the API documentation if you want to implement a client. A
good way to learn how to interact with ycmd is by reading through (and running)
the example_client.py
file. See the README for the
examples folder for details on how to run the example client.
Known ycmd clients:
- YouCompleteMe: Vim client, stable and exposes all ycmd features.
- emacs-ycmd: Emacs client.
- you-complete-me: Atom client.
- YcmdCompletion: Sublime client
- sublime-ycmd: Sublime Text 3 client.
- kak-ycmd: Kakoune client.
- you-complete-me: VSCode client.
- gycm: Geany client.
- nano-ycmd: GNU nano client.
Feel free to send a pull request adding a link to your client here if you've built one.
Building
If you're looking to develop ycmd, see the instructions for running the tests.
This is all for Ubuntu Linux. Details on getting ycmd running on other OS's can be found in YCM's instructions (ignore the Vim-specific parts). Note that ycmd runs on Python 3.8.0+.
First, install the minimal dependencies:
sudo apt install build-essential cmake python3-dev
Next, install the language specific dependencies you need:
sudo apt install golang-go
for Go.sudo apt install npm
for JavaScript and TypeScript.sudo apt install mono-devel
for C#.sudo apt install openjdk-8-jre
for Java.
When you first clone the repository you'll need to update the submodules:
git submodule update --init --recursive
Then run python3 build.py --all
or any of the specific completers listed by
python3 build.py --help
. This should get you going.
For more detailed instructions on building ycmd, see YCM's instructions (ignore the Vim-specific parts).
Supported compilers
- GCC 8 and later
- Clang 7 and later
- Microsoft Visual Studio 2017 v 15.7 and later
API notes
- All strings going into and out of the server are UTF-8 encoded.
- All lines end with
\n
. - All line and column numbers are 1-based, not 0-based. They are also byte offsets, not Unicode codepoint offsets.
- All file paths are full, absolute paths.
- All requests to the server must include an HMAC in the
x-ycm-hmac
HTTP header. The HMAC is computed from the shared secret passed to the server on startup and the request/response body. The digest algorithm is SHA-256. The server will also include the HMAC in its responses; you must verify it before using the response. Seeexample_client.py
to see how it's done. - API is documented in swagger and published on the website.
How ycmd works
There are several completion engines in ycmd. The most basic one is an identifier-based completer that collects all of the identifiers in the file provided in the completion request, other files of the same filetype that were provided previously and any tags files produced by ctags. This engine is non-semantic.
There are also several semantic engines in YCM. There's clangd-based completer that both provide semantic completion for C-family languages. There's also a Jedi-based completer for semantic completion for Python, an OmniSharp-based completer for C#, a gopls-based completer for Go (using gopls for jumping to definitions), a TSServer-based completer for JavaScript and TypeScript, a jdt.ls-based server for Java, and a RLS-based completer for Rust. More will be added with time.
There are also other completion engines, like the filepath completer (part of the identifier completer).
The server will automatically detect which completion engine would be the best in any situation. On occasion, it queries several of them at once, merges the outputs and presents the results.
Semantic engines are triggered only after semantic "triggers" are inserted in
the code. If the request received shows that the user's cursor is after the last
character in string foo; foo.
in a C# file, this would trigger the semantic
engine to
examine members of foo
because .
is a default semantic
trigger for C# (triggers can be changed dynamically). If the
text were string foo; foo.zoo
, semantic completion would still be triggered
(the trigger is behind the zoo
word the user is typing) and the results would
be filtered with the zoo
query.
Semantic completion can also be forced by setting force_semantic: true
in
the JSON data for the completion request, but you should only do this if the
user explicitly requested semantic completion with a keyboard shortcut;
otherwise, leave it up to ycmd to decide when to use which engine.
The reason why semantic completion isn't always used even when available is because the semantic engines can be slow and because most of the time, the user doesn't actually need semantic completion.
There are two main use-cases for code-completion:
- The user knows which name they're looking for, they just don't want to type the whole name.
- The user either doesn't know the name they need or isn't sure what the name is. This is also known as the "API exploration" use-case.
The first use case is the most common one and is trivially addressed with the identifier completion engine (which BTW is blazing fast). The second one needs semantic completion.
Completion string filtering
A critical thing to note is that the completion filtering is NOT based on
the input being a string prefix of the completion (but that works too). The
input needs to be a subsequence match of a completion. This is a fancy way
of saying that any input characters need to be present in a completion string in
the order in which they appear in the input. So abc
is a subsequence of
xaybgc
, but not of xbyxaxxc
.
Completion string ranking
The subsequence filter removes any completions that do not match the input, but then the sorting system kicks in. It's a bit involved, but roughly speaking "word boundary" (WB) subsequence character matches are "worth" more than non-WB matches. In effect, this means given an input of "gua", the completion "getUserAccount" would be ranked higher in the list than the "Fooguxa" completion (both of which are subsequence matches). A word-boundary character are all capital characters, characters preceded by an underscore and the first letter character in the completion string.
Auto-shutdown if no requests for a while
If the server hasn't received any requests for a while (controlled by the
--idle_suicide_seconds
ycmd flag), it will shut itself down. This is useful
for cases where the process that started ycmd dies without telling ycmd to die
too or if ycmd hangs (this should be extremely rare).
If you're implementing a client for ycmd, ensure that you have some sort of
keep-alive background thread that periodically pings ycmd (just call the
/healthy
handler, although any handler will do).
You can also turn this off by passing --idle_suicide_seconds=0
, although that
isn't recommended.
Exit codes
During startup, ycmd attempts to load the ycm_core
library and exits with one
of the following return codes if unsuccessful:
- 3: unexpected error while loading the library;
- 4: the
ycm_core
library is missing; - 7: the version of the
ycm_core
library is outdated. - 8: server is started with python; recompile with python3.
User-level customization
You can provide settings to ycmd on server startup. There's a
default_settings.json
file that you can tweak. See the
Options section in YCM's User Guide for a description on what
each option does. Pass the path to the modified settings file to ycmd as an
--options_file=/path/to/file
flag. Note that you must set the hmac_secret
setting (encode the value with base64). Because the file you are passing
contains a secret token, ensure that you are creating the temporary file in a
secure way (the mkstemp()
Linux system call is a good idea; use
something similar for other OS's).
After it starts up, ycmd will delete the settings file you provided after it reads it.
The settings file is something your editor should produce based on values your
user has configured. There's also an extra file (.ycm_extra_conf.py
) your user
is supposed to provide to configure certain semantic completers. More
information on it can also be found in the corresponding section of YCM's User
Guide.
.ycm_extra_conf.py
specification
The .ycm_extra_conf.py
module may define the following functions:
Settings( **kwargs )
This function allows users to configure the language completers on a per project
basis or globally. Currently, it is required by the libclang-based completer and
optional for other completers. The following arguments can be retrieved from
the kwargs
dictionary and are common to all completers:
-
language
: an identifier of the completer that called the function. Its value ispython
for the Python completer andcfamily
for the C-family completer. This argument is useful to configure several completers at once. For instance:def Settings( **kwargs ): language = kwargs[ 'language' ] if language == 'cfamily': return { # Settings for the libclang and clangd-based completer. } if language == 'python': return { # Settings for the Python completer. } return {}
-
filename
: absolute path of the file currently edited. -
client_data
: any additional data supplied by the client application. See the YouCompleteMe documentation for an example.
The return value is a dictionary whose content depends on the completer.
LSP based completers
LSP servers often support user configuration via the initialise request. These
are usually presented as options in the UI. Ycmd supports this using the
.ycm_extra_conf.py
by allowing the user to specify the exact dictionary of
settings that are passed in the server initialise message. These options are
returned from Settings
under the ls
key. The python dictionary is converted
to json and included verbatim in the LSP initialize request. In order to
determine the set of options for a server, consult the server's documentation or
package.json
file.
A config_sections
object is a dictionary whose keys are "sections" and values
are pieces of settings (usually found in the ls
object) corresponding to
those sections. This is even more underspecified and requires trial and error
to make it work. It is optional and only useful if you explicitly enable
workspace/configuration
support.
Example of LSP configuration:
def Settings( **kwargs ):
if kwargs[ 'language' ] == 'java':
return { 'ls': { 'java.rename.enabled' : False },
# `config_sections` is not used for java...
'config_sections': { 'section0': {} }
In addition, ycmd can use any language server, given a file type and a command
line. A user option language_server
can be used to plug in a LSP server ycmd
wouldn't usually know about. The value is a list of dictionaries containing:
name
: the string representing the name of the servercmdline
: the list representing the command line to execute the server (optional; mandatory if port not specified)port
: optional. If specified, a TCP connection is used to this port. If set to*
, an unused locall port is selected and made availble in thecmdline
as${port}
(see below examples).filetypes
: list of supported filetypes.project_root_files
: Tells ycmd which files indicate project root.capabilities'
: Overrides the default LSP capabilities of ycmd.- If you enable
workspace/configuration
support, check the extra conf details, relevant to LSP servers.
- If you enable
triggerCharacters
: Override the LSP server's trigger characters for completion. This can be useful when the server obnoxiously requests completion on every character or for example on whitespace characters.
{
"language_server": [ {
"name": "gopls",
"cmdline": [ "/path/to/gopls", "-rpc.trace" ],
"filetypes": [ "go" ],
"project_root_files": [ "go.mod" ],
"triggerCharacters": [ "." ]
} ]
}
Or, to use a TCP connection:
{
"language_server": [ {
"name": "godot",
"port": "6008",
"filetypes": [ "gdscript" ]
} ]
}
Or, to use an unused local port, set port
to *
and use ${port}
in the
cmdline
:
{
"language_server": [ {
"name": "someserver",
"cmdline": [ "/path/to/some/server", "--port", "${port}" ],
"port": "*",
"filetypes": [ "somethign" ],
"project_root_files": [ "somethingfile" ]
} ]
}
When plugging in a completer in this way, the kwargs[ 'language' ]
will be set
to the value of the name
key, i.e. gopls
in the above example.
A number of LSP completers are currently supported without language_server
,
usch as:
- Java
- Rust
- Go
- C-family
One can also override the root directory, with project_directory
.
def Settings( **kwargs ):
return { 'project_directory': 'src/' } # The path may be absolute as well.
Note: If an LSP based completer is configured for a language that's supported "built-in", it overrides the built-in support.
C-family settings
The Settings
function is called by the libclang and clangd-based completers to
get the compiler flags to use when compiling the current file. The absolute path
of this file is accessible under the filename
key of the kwargs
dictionary.
The return value expected by both completers is a dictionary containing the following items:
-
flags
: (mandatory for libclang, optional for clangd) a list of compiler flags. -
include_paths_relative_to_dir
: (optional) the directory to which the include paths in the list of flags are relative. Defaults to ycmd working directory for the libclang completer and.ycm_extra_conf.py
's directory for the clangd completer. -
do_cache
: (optional) a boolean indicating whether or not the result of this call (i.e. the list of flags) should be cached for this file name. Defaults toTrue
. If unsure, the default is almost always correct.
The libclang-based completer also supports the following items:
-
override_filename
: (optional) a string indicating the name of the file to parse as the translation unit for the supplied file name. This fairly advanced feature allows for projects that use a 'unity'-style build, or for header files which depend on other includes in other files. -
flags_ready
: (optional) a boolean indicating that the flags should be used. Defaults toTrue
. If unsure, the default is almost always correct.
A minimal example which simply returns a list of flags is:
def Settings( **kwargs ):
return {
'flags': [ '-x', 'c++' ]
}
Formatting settings
The configuration for Format
subcommand can be specified with an extra conf for
the java subserver and for the typescript subserver.
The formatter options can be found below:
These servers support custom formatting options to be supplied in a different
way than the rest. For this purpose the Settings
function can return a
formatter
property.
An example of the formatter configuration would be:
def Settings( **kwargs ):
return {
'formatting_options': {
'org.eclipse.jdt.core.formatter.lineSplit': 30,
}
}
Python settings
The Settings
function allows users to specify the Python interpreter and
the sys.path
used by the completer to provide completion and code
comprehension. No additional arguments are passed.
The return value expected by the completer is a dictionary containing the following items:
-
interpreter_path
: (optional) path to the Python interpreter.~
and environment variables in the path are expanded. If not an absolute path, it will be searched through thePATH
. -
sys_path
: (optional) list of paths prepended tosys.path
.
Usage example:
def Settings( **kwargs ):
return {
'interpreter_path': '~/project/virtual_env/bin/python',
'sys_path': [ '~/project/third_party/module' ]
}
PythonSysPath( **kwargs )
Optional for Python support.
This function allows further customization of the Python path sys.path
. Its
parameters are the possible items returned by the Settings
function for the
Python completer:
-
interpreter_path
: path to the Python interpreter. -
sys_path
: list of Python paths fromsys.path
.
The return value should be the modified list of Python paths.
See ycmd's own .ycm_extra_conf.py
for an example.
Global extra conf file specification
The global extra module must expose the same functions as the
.ycm_extra_conf.py
module with the following additions:
YcmCorePreLoad()
Optional.
This method, if defined, is called by the server prior to importing the c++ python plugin. It is not usually required and its use is for advanced users only.
Shutdown()
Optional.
Called prior to the server exiting cleanly. It is not usually required and its use is for advanced users only.
Backwards compatibility
ycmd's HTTP+JSON interface follows SemVer. While ycmd has seen extensive use over the last several years as part of YCM, the version number is below 1.0 because some parts of the API might change slightly as people discover possible problems integrating ycmd with other editors. In other words, the current API might unintentionally be Vim-specific. We don't want that.
Note that ycmd's internal API's (i.e. anything other than HTTP+JSON) are NOT covered by SemVer and will randomly change underneath you. DON'T interact with the Python/C++/etc code directly!
FAQ
Is HMAC auth for requests/responses really necessary?
Without the HMAC auth, it's possible for a malicious website to impersonate the user. Don't forget that evil.com can send requests to servers listening on localhost if the user visits evil.com in a browser.
This is not merely a theoretical concern; a working proof-of-concept remote code execution exploit was created for ycmd running on localhost. The HMAC auth was added to block this attack vector.
Contributor Code of Conduct
Please note that this project is released with a Contributor Code of Conduct. By participating in this project you agree to abide by its terms.
Contact
If you have questions about the plugin or need help, please use the ycmd-users mailing list.
The author's homepage is http://val.markovic.io.
License
This software is licensed under the GPL v3 license. © 2015-2019 ycmd contributors