Class: Linguist::Language

Inherits:
Object
  • Object
show all
Defined in:
lib/linguist/language.rb

Overview

Language names that are recognizable by GitHub. Defined languages can be highlighted, searched and listed under the Top Languages page.

Languages are defined in ‘lib/linguist/languages.yml`.

Constant Summary collapse

TYPES =

Valid Languages types

[:data, :markup, :programming, :prose]
STRATEGIES =
[
  Linguist::Strategy::Modeline,
  Linguist::Shebang,
  Linguist::Strategy::Filename,
  Linguist::Heuristics,
  Linguist::Classifier
]

Instance Attribute Summary collapse

Class Method Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(attributes = {}) ⇒ Language

Internal: Initialize a new Language

attributes - A hash of attributes



279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
# File 'lib/linguist/language.rb', line 279

def initialize(attributes = {})
  # @name is required
  @name = attributes[:name] || raise(ArgumentError, "missing name")

  # Set type
  @type = attributes[:type] ? attributes[:type].to_sym : nil
  if @type && !TYPES.include?(@type)
    raise ArgumentError, "invalid type: #{@type}"
  end

  @color = attributes[:color]

  # Set aliases
  @aliases = [default_alias_name] + (attributes[:aliases] || [])

  # Load the TextMate scope name or try to guess one
  @tm_scope = attributes[:tm_scope] || begin
    context = case @type
              when :data, :markup, :prose
                'text'
              when :programming, nil
                'source'
              end
    "#{context}.#{@name.downcase}"
  end

  @ace_mode = attributes[:ace_mode]
  @wrap = attributes[:wrap] || false

  # Set legacy search term
  @search_term = attributes[:search_term] || default_alias_name

  # Set extensions or default to [].
  @extensions = attributes[:extensions] || []
  @interpreters = attributes[:interpreters]   || []
  @filenames  = attributes[:filenames]  || []

  # Set popular, and searchable flags
  @popular    = attributes.key?(:popular)    ? attributes[:popular]    : false
  @searchable = attributes.key?(:searchable) ? attributes[:searchable] : true

  # If group name is set, save the name so we can lazy load it later
  if attributes[:group_name]
    @group = nil
    @group_name = attributes[:group_name]

  # Otherwise we can set it to self now
  else
    @group = self
  end
end

Instance Attribute Details

#ace_modeObject (readonly)

Public: Get Ace mode

Examples

# => "text"
# => "javascript"
# => "c_cpp"

Returns a String name or nil



387
388
389
# File 'lib/linguist/language.rb', line 387

def ace_mode
  @ace_mode
end

#aliasesObject (readonly)

Public: Get aliases

Examples

Language['C++'].aliases
# => ["cpp"]

Returns an Array of String names



360
361
362
# File 'lib/linguist/language.rb', line 360

def aliases
  @aliases
end

#colorObject (readonly)

Public: Get color.

Returns a hex color String.



350
351
352
# File 'lib/linguist/language.rb', line 350

def color
  @color
end

#extensionsObject (readonly)

Public: Get extensions

Examples

# => ['.rb', '.rake', ...]

Returns the extensions Array



401
402
403
# File 'lib/linguist/language.rb', line 401

def extensions
  @extensions
end

#filenamesObject (readonly)

Public: Get filenames

Examples

# => ['Rakefile', ...]

Returns the extensions Array



419
420
421
# File 'lib/linguist/language.rb', line 419

def filenames
  @filenames
end

#interpretersObject (readonly)

Public: Get interpreters

Examples

# => ['awk', 'gawk', 'mawk' ...]

Returns the interpreters Array



410
411
412
# File 'lib/linguist/language.rb', line 410

def interpreters
  @interpreters
end

#nameObject (readonly)

Public: Get proper name

Examples

# => "Ruby"
# => "Python"
# => "Perl"

Returns the name String



340
341
342
# File 'lib/linguist/language.rb', line 340

def name
  @name
end

#search_termObject (readonly)

Deprecated: Get code search term

Examples

# => "ruby"
# => "python"
# => "perl"

Returns the name String



371
372
373
# File 'lib/linguist/language.rb', line 371

def search_term
  @search_term
end

#tm_scopeObject (readonly)

Public: Get the name of a TextMate-compatible scope

Returns the scope



376
377
378
# File 'lib/linguist/language.rb', line 376

def tm_scope
  @tm_scope
end

#typeObject (readonly)

Public: Get type.

Returns a type Symbol or nil.



345
346
347
# File 'lib/linguist/language.rb', line 345

def type
  @type
end

#wrapObject (readonly)

Public: Should language lines be wrapped

Returns true or false



392
393
394
# File 'lib/linguist/language.rb', line 392

def wrap
  @wrap
end

Class Method Details

.[](name) ⇒ Object

Public: Look up Language by its name.

name - The String name of the Language

Examples

Language['Ruby']
# => #<Language name="Ruby">

Language['ruby']
# => #<Language name="Ruby">

Returns the Language or nil if none was found.



230
231
232
# File 'lib/linguist/language.rb', line 230

def self.[](name)
  name && @index[name.downcase]
end

.ace_modesObject

Public: A List of languages compatible with Ace.

TODO: Remove this method in a 5.x release. Every language now needs an ace_mode key, so this function isn’t doing anything unique anymore.

Returns an Array of Languages.



271
272
273
274
# File 'lib/linguist/language.rb', line 271

def self.ace_modes
  warn "This method will be deprecated in a future 5.x release. Every language now has an `ace_mode` set."
  @ace_modes ||= all.select(&:ace_mode).sort_by { |lang| lang.name.downcase }
end

.allObject

Public: Get all Languages

Returns an Array of Languages



126
127
128
# File 'lib/linguist/language.rb', line 126

def self.all
  @languages
end

.by_type(type) ⇒ Object

Detect languages by a specific type

type - A symbol that exists within TYPES

Returns an array



40
41
42
# File 'lib/linguist/language.rb', line 40

def self.by_type(type)
  all.select { |h| h.type == type }
end

.colorsObject

Public: A List of languages with assigned colors.

Returns an Array of Languages.



261
262
263
# File 'lib/linguist/language.rb', line 261

def self.colors
  @colors ||= all.select(&:color).sort_by { |lang| lang.name.downcase }
end

.create(attributes = {}) ⇒ Object

Internal: Create a new Language object

attributes - A hash of attributes

Returns a Language object



49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
# File 'lib/linguist/language.rb', line 49

def self.create(attributes = {})
  language = new(attributes)

  @languages << language

  # All Language names should be unique. Raise if there is a duplicate.
  if @name_index.key?(language.name)
    raise ArgumentError, "Duplicate language name: #{language.name}"
  end

  # Language name index
  @index[language.name.downcase] = @name_index[language.name.downcase] = language

  language.aliases.each do |name|
    # All Language aliases should be unique. Raise if there is a duplicate.
    if @alias_index.key?(name)
      raise ArgumentError, "Duplicate alias: #{name}"
    end

    @index[name.downcase] = @alias_index[name.downcase] = language
  end

  language.extensions.each do |extension|
    if extension !~ /^\./
      raise ArgumentError, "Extension is missing a '.': #{extension.inspect}"
    end

    @extension_index[extension] << language
  end

  language.interpreters.each do |interpreter|
    @interpreter_index[interpreter] << language
  end

  language.filenames.each do |filename|
    @filename_index[filename] << language
  end

  language
end

.detect(blob) ⇒ Object

Public: Detects the Language of the blob.

blob - an object that includes the Linguist ‘BlobHelper` interface;

see Linguist::LazyBlob and Linguist::FileBlob for examples

Returns Language or nil.



104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
# File 'lib/linguist/language.rb', line 104

def self.detect(blob)
  # Bail early if the blob is binary or empty.
  return nil if blob.likely_binary? || blob.binary? || blob.empty?

  # Call each strategy until one candidate is returned.
  STRATEGIES.reduce([]) do |languages, strategy|
    candidates = strategy.call(blob, languages)
    if candidates.size == 1
      return candidates.first
    elsif candidates.size > 1
      # More than one candidate was found, pass them to the next strategy.
      candidates
    else
      # No candiates were found, pass on languages from the previous strategy.
      languages
    end
  end.first
end

.find_by_alias(name) ⇒ Object

Public: Look up Language by one of its aliases.

name - A String alias of the Language

Examples

Language.find_by_alias('cpp')
# => #<Language name="C++">

Returns the Language or nil if none was found.



154
155
156
# File 'lib/linguist/language.rb', line 154

def self.find_by_alias(name)
  name && @alias_index[name.downcase]
end

.find_by_extension(extname) ⇒ Object

Public: Look up Languages by file extension.

extname - The extension String.

Examples

Language.find_by_extension('.rb')
# => [#<Language name="Ruby">]

Language.find_by_extension('rb')
# => [#<Language name="Ruby">]

Returns all matching Languages or [] if none were found.



192
193
194
195
# File 'lib/linguist/language.rb', line 192

def self.find_by_extension(extname)
  extname = ".#{extname}" unless extname.start_with?(".")
  @extension_index[extname]
end

.find_by_filename(filename) ⇒ Object

Public: Look up Languages by filename.

filename - The path String.

Examples

Language.find_by_filename('foo.rb')
# => [#<Language name="Ruby">]

Returns all matching Languages or [] if none were found.



168
169
170
171
172
173
174
175
176
177
# File 'lib/linguist/language.rb', line 168

def self.find_by_filename(filename)
  basename = File.basename(filename)

  # find the first extension with language definitions
  extname = FileBlob.new(filename).extensions.detect do |e|
    !@extension_index[e].empty?
  end

  (@filename_index[basename] + @extension_index[extname]).compact.uniq
end

.find_by_interpreter(interpreter) ⇒ Object

Public: Look up Languages by interpreter.

interpreter - String of interpreter name

Examples

Language.find_by_interpreter("bash")
# => [#<Language name="Bash">]

Returns the matching Language



212
213
214
# File 'lib/linguist/language.rb', line 212

def self.find_by_interpreter(interpreter)
  @interpreter_index[interpreter]
end

.find_by_name(name) ⇒ Object

Public: Look up Language by its proper name.

name - The String name of the Language

Examples

Language.find_by_name('Ruby')
# => #<Language name="Ruby">

Returns the Language or nil if none was found.



140
141
142
# File 'lib/linguist/language.rb', line 140

def self.find_by_name(name)
  name && @name_index[name.downcase]
end

.find_by_shebang(data) ⇒ Object

DEPRECATED



198
199
200
# File 'lib/linguist/language.rb', line 198

def self.find_by_shebang(data)
  @interpreter_index[Shebang.interpreter(data)]
end

Public: A List of popular languages

Popular languages are sorted to the top of language chooser dropdowns.

This list is configured in “popular.yml”.

Returns an Array of Languages.



242
243
244
# File 'lib/linguist/language.rb', line 242

def self.popular
  @popular ||= all.select(&:popular?).sort_by { |lang| lang.name.downcase }
end

.unpopularObject

Public: A List of non-popular languages

Unpopular languages appear below popular ones in language chooser dropdowns.

This list is created from all the languages not listed in “popular.yml”.

Returns an Array of Languages.



254
255
256
# File 'lib/linguist/language.rb', line 254

def self.unpopular
  @unpopular ||= all.select(&:unpopular?).sort_by { |lang| lang.name.downcase }
end

Instance Method Details

#==(other) ⇒ Object



493
494
495
# File 'lib/linguist/language.rb', line 493

def ==(other)
  eql?(other)
end

#default_alias_nameObject

Internal: Get default alias name

Returns the alias name String



453
454
455
# File 'lib/linguist/language.rb', line 453

def default_alias_name
  name.downcase.gsub(/\s/, '-')
end

#eql?(other) ⇒ Boolean

Returns:

  • (Boolean)


497
498
499
# File 'lib/linguist/language.rb', line 497

def eql?(other)
  equal?(other)
end

#escaped_nameObject

Public: Get URL escaped name.

Examples

"C%23"
"C%2B%2B"
"Common%20Lisp"

Returns the escaped String.



446
447
448
# File 'lib/linguist/language.rb', line 446

def escaped_name
  EscapeUtils.escape_url(name).gsub('+', '%20')
end

#groupObject

Public: Get Language group

Returns a Language



460
461
462
# File 'lib/linguist/language.rb', line 460

def group
  @group ||= Language.find_by_name(@group_name)
end

#hashObject



501
502
503
# File 'lib/linguist/language.rb', line 501

def hash
  name.hash
end

#inspectObject



505
506
507
# File 'lib/linguist/language.rb', line 505

def inspect
  "#<#{self.class} name=#{name}>"
end

#popular?Boolean

Public: Is it popular?

Returns true or false

Returns:

  • (Boolean)


467
468
469
# File 'lib/linguist/language.rb', line 467

def popular?
  @popular
end

#primary_extensionObject

Deprecated: Get primary extension

Defaults to the first extension but can be overridden in the languages.yml.

The primary extension can not be nil. Tests should verify this.

This method is only used by app/helpers/gists_helper.rb for creating the language dropdown. It really should be using ‘name` instead. Would like to drop primary extension.

Returns the extension String.



433
434
435
# File 'lib/linguist/language.rb', line 433

def primary_extension
  extensions.first
end

#searchable?Boolean

Public: Is it searchable?

Unsearchable languages won’t by indexed by solr and won’t show up in the code search dropdown.

Returns true or false

Returns:

  • (Boolean)


484
485
486
# File 'lib/linguist/language.rb', line 484

def searchable?
  @searchable
end

#to_sObject

Public: Return name as String representation



489
490
491
# File 'lib/linguist/language.rb', line 489

def to_s
  name
end

#unpopular?Boolean

Public: Is it not popular?

Returns true or false

Returns:

  • (Boolean)


474
475
476
# File 'lib/linguist/language.rb', line 474

def unpopular?
  !popular?
end